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IMPROVED CONSTRUCTS FOR MARKER EXCISION BASED ON DUAL- 
FUNCTION SELECTION MARKER 

FIELD OF THE INVENTION 

The inMantlon relates to Improved construe* and methods .^"l^^^^^^a "^^^ 
quences from the genome of plants, based on duaMunction seledion marter 
depending on the employed compound - can act as both negative and counter- 
selection marker. 

BACKGROUND OF THE INVENTION 

An aim of pJant biotechnology is the generation of plants vwlth advantageous novel 
S^aSSei^cs. for example for increasing agricultural ^^^S 
IS ity in foodstuffs or for the production of certain chemicals or pharmaceuticals (Dunwell 
JM (2000) J Exp Bot 51:467-96). 

There are various methods described in art for Inserting heterogenous DNlA sequen^s 
7nto the genome of a host cell or organism. Because of ' ^^""^^^^ 

oenerallv required to employ selection marker to select for cells or organisms whK:n 
Cni^cesllly incorporated the transgenic construct 

transgenic cells or organisms (e.g-, plants or plant cells) to be idenfafied after transfoj- 
rr^n T?,^ In be divided into positive selection marker (confemng a selective ad- 

Se)^^Sv^^^^^^ marker (-"^ taranLt^^^^^ 

counter-selection marker (conferring a selection disadvantage), respectively. 

Neaative selection maiicers are most often employed in methods for P«d««na 
S^te^lls or onanisms. Such negatwe selection markers confer for example a res^ 
SKS te a SocSal compound such as a metabolic Inhibitor (e.g., 2-deoxyglucose^- 
ohoToJate WO 98/4^6). antibiotics (e.g., kanamydn. G 418. bleomycin orj.>,aromy^ 
dnror he^i^des ^ phosphinothridn or glyphos^e). Examples - especially suitable 
for plant transformation - are: 

. Phosphinothricin acetyltransferases (PAT; also named Blalophos *resistance; bar; 
ninrk 1 ea7- CP 0 333 033: US 4,975,374) 

naiaoon^ inactivating dehalogenases (deh) ^^r«r.i-. 
: sd onyrur^- and imidazolinonennactivatlng acetolactate syngases (for example 
mSed variants with, for example, the S4 and/or Hib mutation 

: 0^04^8? ~ci1ets (NPTl, NPTl) coding e.g.. for neomycin 

phosphotransferases ^"^^^J^^^.^^^ (DOG-^I-Gene product: WO 

. ^gZy* phosphatransre«» (HPT), «Wch mediates resistance to hyflrcmydn 

(Vanden Elzen 1985). 
. dihydrof elate reductase (Eichholtz 1 987) 

14. p. Fig + Seq 
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D-amino add metabolizing enzyme (e.g., D-amino acid dehydratases or oxidases; 
WO 03/060133) 



Additional negative selectable marker genes of bacterial origin that confer resistance to 
5 antibiotics include the aadA gene, which confers resistance to the antibiotic spectino- 
mydn, gentamycin acetyl transferase, streptomycin phosphotransferase (SPT). ami- 
noglycoside-3-adenyl transferase and the bleomycin resistance determinant (Hayford 
1988; Jones 1987; Svab 1990; Hille 1986).. 

1 0 Additional selection markers are those which do not result in detoxification of a biocidal 
compound but confer an advantage by increased or improved regeneration, growth, 
propagation, multiplication as the like of the cell or organism comprising such kind of 
**pos{tive selection marker". Examples are isopentenyltransferase (a key enzyme of the 
cytokinin biosynthesis facilitating regeneration of transformed plant cells by selection 

15 on cytokinin-free medium; Ebinuma 2000a; Ebinuma 2000b). Additional positive selec- 
tion markers* which confer a growth advantage to a transformed plant cells In compari- 
son with a non-transfomied one, are described e.g., in EP-^A 0 601 092. Growth stimu- 
lation selection markers may include (but shall not be limited to) p-Glucuronidase (in 
combination with e.g., a cytokinin glucuronide), mannose-6-phosphate isomerase (in 

20 combination with mannose), UDP-galactose-4-epimerase (in combination with e.g., 
galactose). 

The selectable marker gene is useful during the transformation process to select for, 
and Identify, transformed organisms, but typically provides no useful function once the 

25 transformed organism has been identified and contributes substantially to the lack of 
acceptance of these "gene food" products among consumers (Kuiper iHIA et al. (2001) 
Plant J. 27, 503— 528), and few markers are available that are not based on these 
mechanisms (Hare P & Chua NH (2002) Nat BiotechnoL 20, 575-680). Thus, there is 
a demand for new markers for both research and commercial crop production. Aiterna- 

30 tively, there are multiple attempts to develop techniques by means of which marker 
DMA can be excised from plant genome (Ow DW and IMedberry SL (1 995) Crit Rev in 
Plant Sci 14:239-261; Gleave AP et al. (1999) Plant MoL BiaL 40, 223-23 ). 

The person skilled in the art is familiar with a variety of systems for the site-directed 
35 removal of recombinantly introduced nucleic add sequences. They are mainly based 
on the use of sequence specific recombinases. Various sequence-specific recombina- 
tion systems are described, such as the Cre^ox system of the bacteriophage Pi (Dale 
EC and Ow DW (1991) Proc Natl Acad Sci USA 88:10558-10562; Russell SH et aL 
(1992) Mol Gene Genet 234: 49-59; Osborne Bl et al. (1995) Plant J. 7, 687-701), the 
40 yeast FLP/FRT system (Kilby NJ et al. (1995) Plant J 8:637-652; Lyznik LA et al. 
(1 996) Nucleic Acids Res 24:3784-3789), the Mu phage Gin recombinase, the E. coli 
Pin recorhbinase or the R/RS system of the plasmid pSRI (Onouchi H et aL(1995) Mol 
Gen Genet 247:653-660; Sugita Ket aL (2000) Plant J. 22:461-469). . 

45 Zubko et aL (Nature Biotech (April 2000) 1 8{4):442-^5) describe a system for the dele- 
tion of nucleic acid sequences, where the sequence to be deleted is flanked by two 352 
bp attP recognition sequences from the bacteriophage Lambda. Deletion of the flanked 
region^takes place independently of the expression of helper proteins in two out of 
eleven transgenic tobacco lines by spontaneous intrachromosomal recombination be- 

50 tween the attP recognition regions. 
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WO 02/2B071 discloses a method for conditional excision of transgenic sequences 
from the genome of a transgenic organism. Excision occurs directly by action of an 
enzyme (e.g., a recombrnase or a endonudease). Self-exdsing constructs based on a 
site^specific recombinase are described in WOd7/037012 and WO02/1041S. 

S 

WO 03/004659 describes a recombination system based on homologous recombina- 
tion between two homologous sequences induced by action of a sequence specific 
double-strand break inducing enzyme, preferably a meganuclease (homing- 
endonudease). 

10 

Since also the marker exdsion efficiency is most often low, the systems are most often 
combined with counter-selection marker. These are sequences encoding for enzymes 
which are able to convert a non-toxic compound into a toxic compound. In conse- 
quence, only cells will survive treatment with said non-toxic compound which are lack- 
15 ing said counter-selection marker, thereby allowing for selection of cells which have 
successfully undergone ' sequence (e.g.. marker) deletion. Typical counter-selection 
markers known In the art are for example 

a) cytosine deaminases (CodA) in combination with 5-fluorocytoslne (5-FC) (WO 
20 93/012B1; US 5,358,866; Gleave AP et al. (1999) Plant Mol Biol 4Q(2):223-3S; Per- 

era RJ et al. (1993) Plant Mol Biol 23(4):793-79g: Stougaard J (1993) Plant J 3:755- 
761); EP-A1 595 837; Mullen CA et al. (1992) Proc Natl Acad Sci USA Bg(1):33-37: 
Kobayashi T et al. (1995) Jpn J Genet 70(3):409^22^ Schlaman HRM & Hooykaas 
PFF (1997) Plant J 11:1377-1385; Xiaohui Wang H et aU (2001) Gene 272(1-2): 

25 249-255; KoprekTetaL (1999) Plant J 19(S):71 9-726; Gleave APetaL (1999) Plant 
Mol Biol 40(2):223-235; Gallego ME (1999) Plant Mol Biol 39(1):a3-93; Salomon S & 
Puchta H (1998) EMBO J 17(20):6Q86-6095; Thykjaer T et al. (1997) Plant Mol Biol 
35(4):523-530; Serine G (1997) Plant J 12(3):697-701; Risseeuw E (1997) Plant J 
11(4):71 7-728; Blanc V et al. (199S) Biochimie 78(6):S11-S17; Comeille S et al 

30 (2001) Plant J 27:171-178). 

b) Cytochrome P-4S0 enzymes in combination with the sulfonylurea pro-herbidde 
R7402 (2-methylethyl-2-3-dihydra-N-[(4,6-dlmethoxypyrimidine-2-yl)aminocarbonyll- 
1,2-ben20isothiazol-7-sulfonamid-1,1 -dioxide) (O'Keefe DP et ai. (1994) Plant 

35 Physiol 105:473-482; Tissler AF et al. (1999) Plant Ceii 11:1841-1862; Koprek T 
et al. (1999) Plant J 19(6):71 9-726; O'Keefe DP (1991) Biochemistry 30(2):447-SS). 

c) Indoleacetic acid hydrolases lilce e.g., the tms2 gene product from Agrobacterium 
tumetadens in combination with naphthalacetamide (NAIVI) (Fedoroff NV & Smith 

40 DL (1993) Plant J 3:273-289; Upadhyaya NM et al. (2000) Plant Mol Biol Rep 
18:227-223; Depicker AG et al. (1988) Plant Cell rep 104:1067-1071; Kari^ln- , 
Neumannn GA et al. (1991) Plant Cell 3:573-582; Sundaresan V etal. (1995) Gene 
Develop 9:1797-1810; Cecchini E et al. (1998) Mutat Res 401 (1-2): 199-206; Zuhko 
E et al. (2000) Nat Biotec^nol 18:442-^5). 

45 

d) Haloalkane dehalogenases (dhIA gene product) from Xanthobacter autotropicus 
GJ10 in combination with 1 ,2-dichIoroethane (DCE) (Naested H et al. (1999) Plant J 
18(5)571-576; Janssen DB et al. (1994) Annu Ftev Microbiol 48: 163-191; Janssen 
DB (1989) J Bacterid 171(12):6791-9). 

50 
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e) Thymidine kinases (TX), e.g., from Type 1 Herpes Simplex virus (TK HSV-1), in 
combination with acyclovir, ganciclovir or 1 ,2-deoxy-2-fluoro-l>D-arabinofuranosjl-S- 
lodouradle (FIAU) (Czako M & Marton L (1994) Plant Physiol 104:1067-1071; 
Wigler M et ai. (1977) Cell 11(1):223-232: McKnight SL et al. (1980) NucI Adds Res 
S B(24):5949-S9S4: McKnight SL et al. (1980) Nud Adds Res a(24):S931-S948; Pre- 
ston et al. (1981) J Virol 38(2): 593-605; Wagner et aL (1981) Proc Natl Acad Sd 
USA 76(3): 1441 -1446; St Clair etaL(l987) Antimicrob Agents Chemother 
31(6):B44-840). 

10 Several other counter-selection systems are known In the art (see for example Intema- 
tional application WO 04/013333; p. 13 to 20 for a summary; hereby incorporated by 
reference). 

However, the selection systems directed to the production of marker-free cells or or- 
is ganisms described in the art so far requires two separate seledion-marker. 

1. first, a negative selection marker (e.g.» confemng resistance against a herbicide or a 
antibiotic), which allows for selection of cells which have incorporated the transfor- 
mation construct, and 

20 

2. second, a counter-selection marker (see above) which allows for selection of cells 
which have successfully undergone deletion/exdsion of the marker sequences. 

WO 03/060133 Is describing enzymes like the D-amino add oxidase from Rhodotorula 
25 gradlis. The toxic effect of certain amino acids can - depending on the amino acid - be 
lowered or increased by metebolization by e.g., a D-amino acid oxidase. 

There is so far no combined negatlve/counter-seleclion systems described in the art 
which are based on dual-ftinctlon marker making subsequent use of both its properties 

30 as a negative selection marker and a counter-selection marker. There is furthermore an 
unsatisfied demand - espedally in the plant biotechnology area - for fast transfonna- 
tion systems leading to marker-free plants. It is therefore an objective of the present 
invention to provide an efficient negative/counter-selection system which allows for fast 
generation of marker-free transgenic plant cells and plants and which Is based on a 

35 single dual-fiindlon marker. This objective has been achieved by the present invention. 
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BRIEF DESCRIPTION OF THE INVEMTION 

Accordingly, a first embodiment of the Invention relates to a method for producing a 
transgenic plant comprising: 

5 

i) transfomiing a plant cell with a first expression cassette comprising a nucleic acid 
sequence encoding a D-amino add oxidase operably linked with a promoter allow- 
ing expression in plant cells or plants, in combination with at least one second ex- 
pression cassette suitable for conferring to said plant an agronomically valuable 

10 trait, and 

ii) providing at least one frrst compound X, which is phytotoxlc against plant cells not 
functionally expressing said O-amino acid oxidase, wherein said compound X can 
be metabolized by said D-amino acid oxidase into one or more compound(s) Y 

1S which are non-phytotoxic or less phytotoxic than compound X, and 

iii) treating §aid transformed plant cells of step i) with said first compound X in a phyto- 
toxic concentration and selecting plant cells comprising In their genome both said 

. first and said second expression cassette, wherein said first expression cassette is 
20 confemng resistance to said transformed plant cells against said compound X by 
expression of said D-amIno acid oxidase, and 

Iv) providing at least one second compound M, which is non-phytotoxic or moderately 
phytotoxic against plant cells not functionally expressing said D-amino acid oxidase, 
25 wherein said compound M can be metabolized by said D-amino acid oxidase into 
one or more compound(s) N which are phytotoxic or more phytotoxic than com- 
pound M, and 

v) breaking the combination between said first expression cassette and said second 
30 expression cassette and treating resulting said plant cells with said second com- 
pound M in a concentration toxic to plant cells still comprising said first expression 
cassette, and selecting plant cells comprising said second expression cassette but 
lacking said first expression cassette. 

3S The fiist and the second expression cassette may not be combined on one DNA con- 
struct but may be employed in combination In form of - for example - a co- 
transfbmiation approach wherein the two separate molecules are transformed together 
into the plant cells. In a scenario like this the combination of the first and the second 
expression cassette can be broken e.g. by segregation (for example following repro- 

40 duction of resulting plantlets). In this scenario the multiplicity of resulting segregated 
transgenic plantlets can be easily screened for lack of the first expression cassette by 
employment of compound IM. 

However, the first and the second expression cassette may be combined on one DMA 
45 construct Here the combination can be broken for example by means of sequence 
specific sequence deletion or excision e.g., by employing a sequence-specific recom- 
binase or by Induced sequence specific homologous recombination. 
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Accordingly, a second embodiment of the invention relates to a method for producing a 
transgenic plant comprising: 

j) transforming a plant cell with a first DMA construct comprising 

a) a first expression cassette comprising a nucleic acid sequence encoding a D- 
amlno acid oxidase operably linked with a promoter allowing expression in plant 
cells or plants, wherein said first expression cassette Is flanlced by sequences 
which allow for specific deletion of said first expression cassette, and 

b) at least one second expression cassette suitable for confenring to said plant an 
agronomlcally valuable trait, wherein said second expression cassette is not lo- 
calized between said sequences which allow for specific deletion of said first ex- 
pression cassette, and 

ii) providing at least one first compound X, which is phytotoxic against plant cells not 
functionally expressing said D-amino acid oxidase, wherein said compound X can 
be metabolized by said D-amino acid oxidase into one or more compound(s) Y 
which are non-phytotoxic or less phytotoxic than compound X, and 

iii) treating said transformed plant cells of step i) with said first compound X In a phyto- 
toxic concentration and selecting plant cells comprising in their genome said first 
DlslA construct, conferring resistance to said transformed plant cells against said 
compound X by expression of said D-amino acid oxidase, and 

25 , ^ , 

iv) providing at least one second compound M, which is non-phytotoxic or moderately 

phytotoxic against plant cells not functionally expressing said D-amino acid oxidase, 
wherein said compound M can be metabolized by said D-amino acid oxidase Into 
one or more compound(s) N which are phytotoxic or more phytotoxic than com- 
30 pound M, and 

V) inducing deletion of said first expression cassette from the genome of said trans- 
formed plant cells and treating said plant cells with said second compound M in a 
concentration toxic to plant cells still comprising said first expression cassette. 
35 thereby selecting plant cells comprising said second expression cassette but lacking 
said fii^ expression cassette. 

in a prefen-ed embodiment the method of the invention further comprises the step of 
regeneration of a fertile plant. The method may further include sexually or asexually 
40 propagating or growing off-spring or a descendant of the plant regenerated from said 
plant cell. _ 

In another prefen-ed embodiment the first (phytotoxic) compound X is preferably com- 
prising a D-amino acid stmcture selected from the group consisting of D-tryptophane, 
45 D-histldlne. D-arginine, D-threonlne. D-methionine, D-serine. and D-alanine; more 
preferably D-alanine. D-serine, and derivatives thereof. Most preferably, X is compns- 
ing and/or consisting of D-alanine, D-Serine, or derivatives thereof. 

In another preferred embodiment the second (non-phytotoxic. but metabollzable into 
50 phytotoxic) compound M is preferably comprising a D-amino acid structure selected 
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from the group consisting of D-isoleudne, D-valine, D-asparagine. D-leucine. D-lysine, 
D-pnollne, and D-glutamine; more preferably D-isoleudne, D-valine, and derivatives 
thereof. Most preferably. IWi is comprising and/or consisting of D-isoleudne, D-valine, or 
derivatives thereof. 

5 

Preferably, deletion of the first expression cassette can be realfeed by various rneans 
Icnown in the art, induding but not limited to one or more of the following methods: 

a) recombination induced by a sequence specific recombinase, wherein said first ex- 
10 pression cassette is flanlced by con-esponding recombination sites in a way that re- 
combination between said flanking recombination sites results in deletion of the se- 
quences in-between from the genome. 

b) homologous recombination between homology sequences A and A' flanking said 
IS first expression cassette, preferably induced by a sequence-specific double-strand 

break between said homology sequences caused by a sequence specific endonu- 
clease, wherein said homology sequences A and A' have sufficient length and ho- 
mology in order to ensure homologous recombination between A and A', and having 
an orientation which - upon recombination between A and A - will lead to excision 
20 of said first expression cassette from the genome of said plant 

Various means are available for the person skilled in art to combine the dele- 
tion/excision indudng mechanism with the DMA construct of the invention comprising 
the D-amino add oxidase dual-function selection maricer. Preferably, a recombinase or 
2S endonudease employable in the method of the invention can be expressed by a 
method selected from the group consisting of: 

a) incorporation of a second expression cassette for expression of the recombinase or 
sequence-specific endonudease operably linked to a plant promoter into said DMA 

30 construct, preferably together with said first expression cassette flanked by said se- 
quences vMdh allow for spedfic deletion, 

b) incorporation of a second expression cassette for expression of tFie recombinase or 
sequence-specific endonudease operably linked to a plant promoter into the plant 

35 cells or plants used as target material for the transfonnation thereby generating 
master cell lines or cells, 

c) incorporation of a second expression cassette for expression of the recombinase or 
sequence-specific endonudease operably linked to a plant promoter into a separate 

40 DMA construct, whksh is transfonned by way of co-transformation with said first DNA 
construct into said plant cells, 

d) incorporation of a second expression cassette for expression of the recombinase or 
sequence-specific endonudease operably linked to a plant promoter into the plant 

45 cells or plants which are subsequently crossed with plants comprising the DNA con- 
struct of the invention. 

In another preferred embodiment the mechanism of deletlon/exdsion can be induced 
or activated in a way to prevent premature deletion/exdsion of the dual-function 
50 mariter Preferably, thus expression and/or activity of an preferably employed se- 
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quence-specific recombinase or endonudease can be induced and/or activated, pref- 
erably by a method selected from the group consisting of 



a) inducible expression by operably Unking the sequence encoding said recombinase 
5 or endonudease to an indudble promoter, 

W indudble activation, by employing a modified recombinase or endonudease com- 
prising a ligand-blnding-domain, wherein activity of said modified recombinase or 
endonudease can by modified by treatment of a compound having binding activity 
10 to said ligand-blnding-domafri- 

Preferably. thus the method of the Inventions results in a plant cell or plant which is 
selection marker-free. 

15 Another subject matter of the invention relates to DNA constructs which are suteWe for 
employing in the method of the invention- A DNA construct suitable for use within the 
present invention is preferably comprising 

a) a first expression cassette comprising a nudeic add sequence encoding a D-amino 
20 acid oxidase operably linked with a promoter allowing expression in plant cells or 
plants, wherein saW first expression cassette is flanked by sequences which allow 
for specific deletion of said first expression cassette, and 

at least one second expression cassette suitable for conferring to said plant an 
25 ' aaronomically valuable trait, wherein said second expression cassette is not locai- 
teed between said sequences whidi allow for specific deletion of said first expres- 
sion 



The D-amino acid oxidase expressed from the DNA-construd of the invention has 
30 preferably metabdising activity against at least one D-amino add and compnses a se- 
quences motive having the following consensus sequence: 

[UVWIHUVMl-H*-[NHAl-Y-G-X-[GSAHGSA]-x-G-Xs-G-x-A 

as wherein amino add residues given in brackets represent aHemative residues for the 
;!fsp^?ive po^^^^^^^^^^ X represent any amino add residue, and Indices numbers .nd.cate • 
the respective number of consecutive amino add residues. 

In an orefen-ed embodiment D-amino add oxidase expressed from the DNA-constmct 
40 S me^Son has preferably enzymatic activity against at least one of fhe amino ao- 
'° T^Tc^TlS^T^o^^oup Lsisting of D-alanlne. D-serine. D^soleuane^ D^^l.ne 

and derivatives thereof. Preferably said D-amino acd oxidase is selected from the 

group of amino add sequences comprising 

45 a) the sequences described by SEQ ID NO: 2. 4. 6. 8. 10. 12. and 14. and 

b) the sequences having a sequence homology of at least 40% PJ^^^^kv SEQ 
more preferably 80%. most pr^erably 95% with a sequence as descnbed by SEQ 
ID NO: 2, 4. 6, 8, 10. 12, and 14, and 

50 
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c) the sequences hybridizing under low or high stringency «f "^^j*"^ - Pf?!^?']}' 
under high stringency conditions - with a sequence as descnbed by SEQ id no. 2. 

4, 6. 8, 10. 12, and 14. 

5 For ensuring maricer deletion / excision the expression cassette for ttie D-amino add 
oxidase (the first expression construct) comprised in the DMA construct of the invenfaon 
is flanked by recombination sites for a sequence specific recombmase m ^ ^^e 
recombination induced between said flanlcing recombination artes results in deletion of 
the said first expression cassette from the genome. 

° Preferably said sequences which allow for specific deletion of said first expression cas- 
sette are selected from the group of sequences consisting of 

a) recombination sites for a sequences-specific racombinase arranged in a way that 
5 recombination between said flanking recombination sites results in deleton of the 
sequences in-between from the genome, and 

h) homology sequences A and A' having a sufficient length and homology in order to 
^ ensure honSiogous recombination between A and A". «nd ^-^^J"? °" 
10 which - upon recombination between A and A" - will result in deletion of the se. 
quences in-between from the genome. 

There are various recombination sites and corresponding sequence specific recombi- 
nases known in the art, which can be employed for the purpose of the invention. 

^ in a preferred embodiment, deletion / exdsion of the dual-martcer sequence |s deleted 
by homologous recombination induced by a sequence-specific double-sfrand break. 
tL basic prindpais are disclosed in WO 03/004659. For this purpose the first expres- 
iron coTsC (encoding for the dual-function martcer) is flanted by homology se- 
30 au^nSs A and A', wherein said homology sequences have suffiaent lengtt. and ho- 
mdo^ln order to ensure homologous recombination between A and A', and hav.ng an 
TrSiSion whid. - upon recombination between A and A' - '^^^^^^^ 
first expression cassette from the genome. Furthermore, the sequence flanked by sa^ 
homotoay sequences further comprises at least one recognition sequence of at least 
-.S ^rbase dS2 fer^e site^iirected induction of DMA double-strand breaks by a se- 
Z^^^^I dU^ double-strand break Indudng enzyme, preferably a seq^c- 
BpS^ DNA-endonudease, more preferably a h°"^ins-f fif'^,^^^^^^ 
an endonudease selected fiom the group consisting of l-Scel. I-Cpal. I-Cpall. I-Crel 
and l-Chul or chimeras thereof with ligand-binding domains. 

*° TK» ^v«resfii«n cassette for the endonudease or recombinase (comprising a se- 
I^^ncSpeS?" r^m^ase or endonudease operably linked to a plant Pro^^^ 
IT^Zdin the DMA constnjd of the Invention. Preferably, said second expression 
Sss'i^ ts 1^^^^ With said first expression cassette flanked by said sequences 

45 vMdn allow for spedfic deletion. 

pre^^reXtion / exdsion of the dual-function martcer dunng a penod where rts ao- 
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tion as a negative selection marker is still required. Preferably induction / activation can 
be iBalized by a method selected from the group consisting of 

a) inducible expression by operably Unking the sequence encoding said recombinase 
5 or endonuclease to an inducible promoter, 

b) inducible activation, by emplo^ng a modified recombinase or endonuclease com- 
prising a Hgand-bindlng-domain, wherein actixnty of said modified recombinase or 
endonuclease can by modified by treatment of a compound having binding activity 

10 to said ligand-binding-domain. 

Further embodiments of the inventions are related to transgenic vectors comprising a 
DNA construct of the invention. Transgenic cells or non-human organisms comprising a 
DMA construct or vector of the Invention. Preferably said cells or non-human organisms 
15 are plant cells or plants, preferably plants which are of agronomical use. 

The present invention enables generation of martcer-free transgenic cells and organ^ 
isms, preferably plants, an accurately predictable manner with high efficiency. 

20 GENERAL DEFINITIONS 

The teachings, methods, sequences etc. employed and described in the international 
patent applications WO 03/004659, WO 04/013333. WO 03/060133 are hereby incor- 
porated by reference. 

To facilitate understanding of the invention, a number of terms are defined below. It is 
to be understood that this invention is not limited to the particular methodology, proto- 
cols, cell lines, plant species or genera, constructs, and reagents described as such. It 
is also to be undeistood that the terminology used herein is for the purpose of describ- 
ing particular embodiments only, and is not intended to limit the scope of the present 
invention which vwill be limited only by the appended claims. It must be noted that as 
used herein and in tfie appended claims, the singular forms "a" and "the" include plural 
reference unless the context cleariy dictates othenwise. Thus, for example, reference to 
"a vector" is a reference to one or more vectors and includes equivalents tiiereof 
known to those sidlied in the art, and so forth. 

The term "about" is used herein to mean approximately, roughly, around, or In the re- 
gion of. When the term "about" is used in conjunction with a numerical range, it modi- 
fies that range by extending the boundaries above and below the numerical values set I 
40 forth. In general, the term "about" is used herein to modify a numerical value above and 
below the stated value by a variance of 20 percerit up or down (higher .or lower). 

As used herein, the word "or" means any one member of a particular list and also in- 
cludes any combination of members of that list. I 

"Agronomically valuable trait" include any phenotype In a plant organism that is useful I 

or advantageous for food production or food products, including plant parts and plant I 

products. Non-food agricultural producte sue* as paper, efc. are also included. A partial I 

list of agronomically valuable traits Includes pest resistance, vigor, development time I 

(time to harvest), enhanced nutrient content, novel growth pattems, flavors or colors. I 
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satt. heat, drought and cold tolerance, and the like. Preferably, agronomlcally valuable 
traits do not Include selectable marker genes (e. g.. genes encoding herbicide or anti- 
biotic resistance used only to facilitate detection or selsction of transformed cells), 
homione biosynthesis genes leading to the production of a plant hormone (e.g., ausiins. 

5 gibberllins, cytokinins, absdsic acid and ethylene that are used only for selection), or 
reporter genes (e-fl. luciferase, glucuronidase, chloramphenicol acetyl transferase 
(CAT etc ). Such agronomically valuable Important traits may Include improvement of 
pest resistance (6.g.. Melchers et al. (2000) Cun Opin Plant Biol 3(2): 147-52). vigor, 
development time (time to harvest), enhanced nutrient content, novel growth patterns. 

10 flavors or colors, salt, heat, drought, and cold tolerance (e.g., Sakamoto et al. (2000) J 
Exp Bot 51(342):81-B; Saijo et al. (2000) Plant J 23(3): 319-327; Yeo et al.(2000) Mol 
Cells 10(3):263-8; Cushman et al. (2000) Gun Opin Plant Biol 3(2):1 17-24), and the 
Uke. Those of skill will recognize that there are numerous polynucleotides from which to 
choose to confer these and other agronomically valuable traits. 

15 

As used herein, the term "amino add sequence" refers to a list of abbreviations, letters, 
charadere or words representing amino add residues. Amino adds may be referred to 
herein by either their commonly known three letter symbols or by the one-letter sym- 
bols recommended by the lUPAC-lUB Biochemical Nomendature Commission. Nu- 

20 deotides, likewise, may be refeired to by their commonly accepted single-letter codes. 
The abbreviations used herein are conventional one letter codes for the ammo acids: 
A. alanine; B. asparagine or aspartic add; C. cysteine; D aspartic add; E. glutamate. 
glutamic add; F. phenylalanine; G. glydne; H histidine; I iadeudne; K. lysine; L. leu- 
dne M methionine; N, asparagine; P. proline; Q. glutamine; R, arginine ; S, senne; T, 

25 threonine: V. valine; W. tryptophan; Y, ^rosine; Z. glutamine or glutamic acid (see L. 
Stryer. Biochemistry. 1988. W. H. Freeman and Company. New York. The letter "x' as 
used herein within an amino add sequence can stand for any amino add residue. 

The terni "nucleotide sequence of interest" refers to any nudeotlde sequence, the ma- 
30 nipulation of which may be deemed desirable for any reason (e.g., confer improved 
qualities), by one of ordinary skill in the art Such nudeotlde sequences indude. but are 
not limited to. coding sequences of stmdural genes (e.g., reporter genes, selechon 
marker genes, oncogenes, drug resistance genes, growth factors, ete.). and non- 
coding regulatory sequences which do not encode an mRNA or protein product. (e.g., 
3S promoter sequence, polyadenylation sequence, tennination sequence, enhancer se- 
quence, etc.). A nudeic add sequence of Interest may preferably encode for an 
agronomically valuable trait. 

The temi "nudeic add" refers to deoxyribonudeotides or rIbonudeoUdes and polymers 
40 or hybrids thereof in either single-or double-stranded, sense or antisense form . Unless 
othenwse indicated, a particular nudeic add_sequence also implicitly encompasses 
consen/atively modified variants thereof (e. g.. degenerate codon sub^tutions) and 
complementary sequences, as well as the sequence e^PlSJ^ '".?'=^5J i. T^?- 
dele acid" is used interchangeably herein wRh "gene", "cDNA, "mRNA". "oligonudeo- 

45 tide," and "polynudeotide". 

The phrase "nucleic add sequence" refers to a single or double-stranded polymer of 
deoxyribonudeotide or ribonudeoKde bases read from the S'- to the 3'-end. It indudes 
chromosomal OUA. self-replicating plasmids. infectious polymers of DMA or RNA and 
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DMA or RNA that performs a primarily structural role. "Nucleic acid sequence" also 
refers to a consecutive list of abbreviations, letters, characters or words,, which repre- 
sent nucleotides. In one embodiment, a nucleic add can be a "probe" which is a rela- 
tively short nucleic add. usually less than 100 nucleotides in length. Often a nudeic 
acid probe is from about 50 nudeotides in length to about 10 nucleotides in length. A 
*target region*' of a nucleic acid is a portion of a nucleic add that is identified to be of 
interest. A "coding region" of a nudeic acid is the portion of the nudeic add which is 
transcribed and translated in a sequence-spedfic manner to produce into a particular 
polypeptide or protein when placed under the control of appropriate regulatory se- 
quences. The coding region Is said to encode sudi a polypeptide or protein. 

A '*polynudeotide constmct" refers to a nudeic acid at least partly created by recombi- 
nant methods. The temn "DNA construct" is referring to a polynucleotide construct con- 
sisting of deoxyribonucleotides. The construct may be single- or - preferably - double 
stranded. The construct may be drcular or linear. 

The skilled worker is ^miliar vtrith a variety of ways to obtain one of a DNA construct 
Constructs can be prepared by means of customary recombination and doning tech- 
niques as are described, for example, In T. Maniatis, E.F. Fritsch and J. Sambrook, 
Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, Cold 
Spring Harbor. NY (19B9), in TJ. Sllhavy, M.L. Berman and LW. Enquist, Experiments 
with Gene Fusions, Cold Spring HariDor Laboratory, Cold Spring Harbor, NY (1984) and 
in Ausubel, F.M. et al., Current Protocols in Molecular Biology. Greene Publishing 



25 

The term ''sense" is understood to mean a nudeic acid having a sequence which is 
homologous or identical to a target sequence, for example a sequence which binds to a 
protein transcription factor and vt^hich is involved in the expression of a given gene. 
According to a preferred embodiment, the nudeic acid comprises a .gene of interest 
30 and elements allowing the expression of the said gene of interest. 

The term "antisense" is understood to mean a nucleic acid having a sequence com- 
plementary to a target sequence, for example a messenger RNA (mRNA) sequence 
the blocking of whose expression is sought to be initiated by hybridization with the tar- 
35 get sequence. 

As used herein, the terms "complementary" or "complementarit/* are used in reference 
to nudeotide sequences related by the base-pairing rules. For example, the sequence 
6'-AGT-3' is complementary to the sequence 5*-ACT-3*. Complementarity can be "par- 

40 tial" or "total" "Partial" complementarity Is vtfhere one or more nudeic acid bases is not 
matched according to the base pairing mles. 'Total" or "complete" complementarity 
between nucleic adds is where each and every nucleic acid base is matched with an- 
other base under the base pairing rules. The degree of complementarity between nu- 
cleic add strands has significant effects on the effidency and strength of hybridization 

45 between nudeic acid strands. A "complement" of a nudeic acid sequence as used 
herein refers to a nucleotide sequence whose nudeic acids show total complementarity 
to the nucleic acids of the nudeic acid sequence. 

The term "genome" or "genomic DNA" is refening to the heritable genetic infomriation of 
SO a host organism. Said genomic DNA comprises the DNA of the nudeus (also refen^ed 
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to as chromosoinal DMA) but also the DNtA of the plastids (e.g., chloroplasts) and other 
cellular organelles (e.g., mitochondria). Preferably the terms genome or genomic DMA 
is referring to the chromosomal DfSiA of the nucleus. 

The temn "chromosomal DMA" or "chromosomal DMA-sequence" is to be understood 
as the genomic DMA of the cellular nucleus independent from the cell <^cle status. 
Chromosomal DMA might therefore be organteed in chromosdmes or chromatids, they 
might be condensed or uncoiled. An insertion into the chromosomal DMA can be dem- 
onstrated and analyzed by various methods known in the ari: like e.g., polymerase 
chain reaction (PCR) analysis, Southern blot analysis, fluorescence in situ hybridization 
(FISH), and in situ PCR. 

■ 

The term "gene" refers to a coding region operably jomed to appropriate regulatory 
sequences capable of regulating the expression of the polypeptide in some manner. A 
gene includes untranslated regulatory regions of DNA (e.g., promoters, enhancers, 
repressors, ete.) preceding (upstream) and following (downstream) the coding regiorl 
(open reading frame, ORF) as well as, where applicable, inten/ening sequences (Le., 
introns) between Individual coding regions (/.a, exons). The tenm ''structural gene" as 
used herein Is Intended to mean a DNA sequence that is transcribed into mRNIA which 
18 then translated into a sequence of amino acids characteristic of a specific polypep- 
tide. 

As used herein the term "coding region" when used in reference to a structural gene 
refers to the nucleotide sequences which encode the amino acids found in the nascent 
polypeptide as a result of translation of a mRNA molecule. The coding region is boun- 
ded, in eukaryotes, on the S'-side by the nucleotide triplet "ATG'^ which encodes the 
initiator niethionine and on the 3'-side by one of the three triplets which specilV stop 
codons (#-e., TAA, TAG, TGA). In addition to containing introns, genomic forms of a 
gene may also include sequences located on both the 5 - and 3'-end of the sequences 
^which are present on the RNA transcript. These sequences are referred to as "flanking" 
sequences or regions (these flanking sequences are located 5' or 3' to the non- 
translated sequences present on the mRNA transcript). The S'-flanking region may 
contain regulatory sequences such as promoters and enhancers which control or influ- 
ence the transcription of the gene. The 3 -flanking region may contain sequences which 
direct the termination of transcription, posttranscriptional cleavage and polyadenylation. 



The term "expression construct" or "expression construct" as used herein is intended to 
mean the combination of any nucleic acid sequence to be expressed in operable link- 
age with a promoter sequence and - optionally - additional elements (like e.g., termi- 
40 nator and/or polyadenylation sequences) which fadlitate expression of said nucleic 
* acid sequence. 

The term "promoter," ''promoter element," or "promoter sequence" as used herein, re* 
fers to a DtslA sequence which when llgated to a nucleotide sequence of interest is ca- 

45 pable of controlling the transcription of the nucleotide sequence of interest into mRNA. 
A promoter is typically, though not necessarily, located 5' (/.e., upstream) of a nucleo- 
tide sequence of interest (e.g., proximal to the transcriptional start site of a structural 
gene) whose transcription into mRNA it controls, and provides a site for specific binding 
by RNA polymerase and other transcription factors for initiation of transcription. A 

5Q polynucleotide sequence is "heterologous to" an organism or a second polynucleotide 
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sequence if it originates from a foreign spedes. or. if ^'^^^^^^^^f-^,"''^ 
fiS from Its original form. For example, a promoter operab ly hnted ^ aheterologo^^ 
coding sequent refers to a coding sequence from a ^^^^^.<^^^'^ ^^"^ ^fj^^ 
which the promoter was derived, or. if from the same speaes. a coding sequence 
which is not naturally assodaled with the promoter (e. g. a S^^^^^l^^'^^^^n 
ing sequence or an allele from a different ecotype or vanety). Suitable promoters can 
be derived from plants or plant pathogens like e.g.. plant viruses. 

Promoters may be tissue specific or cell specific. The term 'tissue specific" as it applies 
to a promoter Jeffers to a promoter that is capable of directing 

nucleotide sequence of interest to a specific type of tissue (e.g.. petals) in ttie relative 
ateenl of S^ression of the same nucleotide sequence of interest in a d-ffe^rt 
of tissue (e g. roots). Tissue specificity of a piomoter may be evaluated by, exam- 
ple, operably tinking a reporter gene to the promoter sequence to generate a ^porter 
constnict introducing the reporter constmct into the genome of a plant such that the 
^poS^r^onsfruct is integrated into every tissue of the resulting ^^nsge"^ pla^^^^ 
d^ecHna the expression of the reporter gene (e.g.. detecting mRNA. protein, or the 
of a prSS^ncoded by the'^^porter gene) in different tissues of the tra^gen^ 
l^Xe detection of a greater level of expression of the reporter gene one or mo^ 
tissues relative to the level of expression of the reporter gene in other tissues shows 
tLi the pm^^^^^^ * specific for the tissues In which greater levels of ^ression a^ 
deeded T^e term "cell type specific" as applied to a promoter refers to a p omoter 
tS!s ' J?a dii^iS seLtive expression of a nucleotide sequence of .rrte^s^ 

r ^ ^^^^ r.=.i^fivii^ ahsence of exDression of the same nucleotide 

^^^cHf TrSere;tT:: di;;^^^^^^^ cel. withm the same «ssu. The temr. '^efl 

^e soedfic" when applied to a promoter also means a promoter capable of promoting 
5SeXe^.^2on of a nucleotide sequence of interest In a region wrthm a single 
feS^ spL^cSy of a promoter may be assessed ■ 

- 21! «i 1 « fti js actiwitv staining (as described for example in Example 7) or immu- 

^^^^^ 

binding detected (e.g.v^ av^*^^^ h ^^^^^^ ^ ^^3^5 

stantially any cell and any tissue. In contras^^ I^ S^SrHnked nuclei acid sequence 
capable of directing a level °*^*^«"^^^^f " '^^"^^^^^ ^gM efc.) whl^ is different " 

^orth^i^Tofir:^^^^^^ — - - - 

sence of the stimulus. 

* * « «^r,e in all tissues of 3 transgenic plant or other organism is 

Where expression of a f "^^"^X!-^^^^^^ ^hich is generally active under most 
desired, one can use a "'^^'^^^^J^^^^Z or cell differentiation (Benfey et al. 

3 S sSHt ^!:i^r S^^e constituUve. Suitable constituf.e promoters for use .n 
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plants include, for example, the cauliflower mosaic virus (CaMV) 3SS transcription Inn 
tiation region (Franck et al. (1980) Cell 21:285-294; Odell et al.(1985) Nature 313:810- 
812; Shewmaker et al. (1985) Wology 140:281-288; Gardner et ai. 1986, Plant Mol. 
Biol. 6, 221-228), the 19S transcription initiation region (US 5,352,605 and 
S WO 84/02913), and region VI promoters, the 1 -or 2'-promoter derived from T-D^4A of 
Agrobacterium tumefeciens, and other promoters active in plant cells that are known to 
those of skill in the art. Other suitable promoters Indude the full-length transcript pro- 
moter from Figwort mosaic virus, actin promoters, histone promotefs, tubulin promot- 
ers, or the mannopine synthase promoter (MAS). Other constitutive plant promoters 

10 include various ubiquitin or polyubiquitin promoters derived from, inter alia, Arabidopsis 
(Sun and Caliis (1997) Plant J 11(5): 1017-1027), the mas, Mac or DoubletWIac promot- 
ers (US 5,106,739; Comal et al. (1990) Plant Mol Biol 15:373-381), the ubiquitin pro- 
moter (Hoitorf S et aK (1995) Plant Mol Biol 29:637-649) and other transcription initia- 
tion regions from various plant genes known to those of skill In the art Useful promot-- 

15 ers for plants also include those obtained from Ti-or Rl-plasmids, from plant cells, plant 
viruses or other organisms whose promoters are found to be functional in plants. Bac- 
terial promoters that function in plants, and thus are suitable for use In the methods of 
the invention include the octopine synthetase promoter, the nopaline synthase pro- 
moter, and the mannopine synthetase promoter. Suitable endogenous plant promoters 

20 include the ribulose-1,6-biphosphate (RUBP) cariDoxylase small subunit (ssu) promoter, 
the a-conglydnin promoter, the phaseolin promoter, the ADH promoter, and heat- 
shock promoters. Further preferred constitutive promoters are the nitrilase promoter 
from Arabidopsis thaliana (WO 03/008596) and the Pisum sativum pbcA promoter (e.g., 
as Incorporated In the constmct described by SEQ ID NO: 15; base pair 6479 - 7341, 

26 complementary orientation). 

Of course, promoters can regulate expression all of fine time in only one or some tis- 
sues. Alternatively, a promoter can regulate expression in all tissues but only at a spe- 
cific developmental time point As noted above, the excision promoter (i. e., the pro- 

30 meter that is linked to the sequence-specific DNA cleaving polynucleotide) is generally 
not constitutive, but instead is active for only part of the life cycle or at least one tissue 
of the transgenic organism. One can use a promoter that directs expression of a gene 
of interest in a specific tissue or is otherwise under more precise environmental or de- 
velopmental control. Examples of environmental conditions that may affect transcription 

35 by inducible promoters include pathogen attack, anaerobic conditions, ethylene or the 
presence of light. Promoters under developmental control include promoter that initi- 
ate transcription only in ceri:ain tissues or organs, such as leaves, roots, fruit, seeds, or 
flowers, or parts thereof. The operation of a promoter may also vaiy depending on Its 
location In the genome. Thus, an inducible promoter may become fully or partially con- 

40 stitutive in certain locations. 

Examples of tissue-specific plant promoters under developmental control include pro- 
moters that initiate transcription only in certain tissues, such as fruit seeds, flowers, 
anthers, ovaries, pollen, the meristem, flowers, leaves, stems, roots and seeds. The 
45 tissue-specific ES promoter fnam tomato is paiticulariy useful for directing gene expres- 
sion so that a desired gene product is located in fnJits. See. e. g.. Lincoln et al. (1988) 
Proc Natl Acad Sci USA 84:2793-2797; OeDcman et al. (1988) EMBO J 7:3315-3320 ; 
Deikman et al. (1992) Plant Physiol 100:2013-2017. Other suitable seed specific pro- 
moters Indude those derived from the following genes: MAC1 from maize (Sheridan et 



SO al. (1 996) Genetics 1 42: 1 009-1 020, Cats from maize (GenBank No. L05934. Ableretal. 
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(1993) Plant Moi Biol 22:10131-1036, the gene encoding oleosin IBkD from maize 
(GenBank No. J05212, Lee et al. (1994) Plant Mol Bfol 26:1981-1987), viviparous-1 
from Arabidopsis (Genbank No. U93215). the gene encoding oleosin from Arabldopsis 
(Genbank No. Z17657), Atmyd from Arabidopsis (Urao et al. (1996) Plant Mai Biol 
32:571-576, the 2s seed storage protein gene family from Arabidopsis (Conceicao et 
al. (1994) Plant 5:493-505) the gene encoding oleosin 20kD from Brasslca napus 
(GenBank No. M639B5), napin from Brassica napus (GenBank No. J02798, Josefsson 
et al. (1987) J, BioL Chem. 262:12196-12201). the napin gene family (e.g., from Bras- 
sica napus ; Sjodahl et al. (1995) Planta 197:264-271, US 5.608.152; Stalberg K, et al. 
(1996) L Planta 199: 515-519), the gene encoding the 2S storage protein from Bras- 
sica napus (Dasgupta et al. (1993) Gene 133: 301-302), the genes encoding oleosin A 
(Genbank No. U09118) and oleosin B (Genbank No. U09119) from soybean, the gene 
encoding low molecular weight sulphur rich protein from soybean (Choi et ai. (1995) 
Mol Gen Genet 246:266-268), the phaseolin gene (US 5,504,200, Bustos MM et al., 
Plant Cell. 19B9;1(g):839-53). the 2S albumin gene (JosefTson LG et aL(1987) J Biol 
Chem 262: 12196-12201), the legumin gene (Shirsat A et al. (1989) Mol Gen Genet 
215(2):326-331), the USP (unknown seed protein) gene (Baumlein H et al. (1991) Mol 
Gen Genetics 225(3):4S9-67), the sucrose binding protein gene (WO 00/26388), the 
legumin B4 gene (LeB4; Baumlein H etal. (1991) Mol Gen Genet 225:121-128; Bae- 
umlein et al. (1992) Plant J 2(2):233-239; Fiedler U et al. (1995) Biotechnology (NY) 
13(10): 1090-1 093), the Ins Arabidopsis oleosin gene (W0984S461). the Brassica Bce4 
gene (WO 91/13980), genes encoding the "high-molecular-weight glutenin*" (HMWG), 
gliadin, branching enzyme, ADP-glucose pyrophosphatase (AGPase) or starch syn- 
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slon in monocots such as maize, barley, wheat, rye, rice and the like. Promoters which 
may advantageously be employed are the promoter of the Ipt2 or Ipti gene 
(WO 95/15389, WO 95/23230) or the promoters described in WD 99/16890 (promoters 
of the hordein gene, the giutelin gene, the oryzin gene, the prolamine gene, the gliadin 
gene, the zein gene, the kaslrin gene or the secalin gene). 

Further suitable promoters are, for example, specific promoters for tubers, storage 
roots or roots such as, for example, the class I patatin promoter (B33). the potato 
cathepsin D Inhibitor promoter, the starch synthase (GBSS1) promoter or the sporamin 
promoter, and fruit-spedfic promoters such as, for example, the tomato fruit-specific 
promoter(EP-A 409 625). 

Promoters which are furthermore suitable are those which ensure leaf-specific expres- 
sion. Promoters which may be mentioned are the potato cytosollc FBPase promoter 
(WO 98/18940), the Rubisco (ribulose-1 ,S-bisphosphate carboxylase) SSU (small sub- 
unit) promoter or the potato ST-LSI promoter (Stockhaus et al. (1989) EMBO J 
8(9):244S-2451). Other preferred promoters are those which govem expression in 
seeds and plant emSryos. 

Further suitable promoters are, for example, fmit-maturation-specific promoters such 
gs. for example, the tomato fruft-maturation-specific promoter (WO 94/21794), flower- 
specific promoters such as, for example, the phyloene synthase promoter 
(WO 92/16635) or the promoter of the P-n- gene (WO 98/22593) or another node- 
specific promoter as described In EP-A 249676 may be used advantageously. The 
promoter may also be a pith-specific promoter, such as the promoter isolated from a 
plant TrpA gene as described in WO 93/07278. A development-regulated promoter is, 
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inter alia, described by Baerson et al- (Baerson SR, l^mppa GK (1993) Plant IMol Biol 
22(2):2S5-67). 

Other preferred promoters are promoters induced by biotic or abiotic stress, such as, 
S for example, the pathogen-lndudble promoter of the PRP1 gene (Ward et al.. Plant Mol 
Biol 1993 22- 361-366), the tomato heat-induable hspSO promoter (US 5,167,267). the 
potato ch'ill-inducible alpha-amylase promoter (WO 96/12814) or the wound-induced 
pinll promoter (EP37S091). 

1 0 Promoter may also encompass further promoters, promoter elements or minimal pro- 
moters capable of modifying the expression-specific characteristics. Thus, for ^mple, 
the tissue-spedfic expression may take place in addition as a function of certain stress 
Actors owing to genetic control sequences. Such elements are, for example, de- 
scribed for water stress, abscislc acid (Lam E and Chua NH (1991) J Biol Chem 

15 26S(26):17131 -171 3S) and heat stress (Schoffl F et al. (1989) Molecular & General 
Genetics 217(2-3):246-S3). 

The tenn "operable linkage" or "operably linked" is to be understood as meaning, for 
example, the sequential arrangement of a regulatory element (e.g. a promoter) with a 
20 nucleic add sequence to be expressed and. if appropriate, further regulatory elements 
(such as e.g.. a terminator) in such a way that each of the regulatory elements can ful- 
fill its intended function to allow, modify, facilitate or othennrise influence expression of 
said nudeic add sequence. The expression may result depending on the an^ngement 
of the nudeic add sequences in relation to sense or antisense RNA. To this end. direct 
25 linkage in the chemical sense is not necessarily required. Genetic control sequences 
such as. for example, enhancer sequences, can also exert their fundion on the target 
sequence fnam positions which are further away, or indeed from other DMA molecules. 
Preferred arrangements are those in which the nudeic add sequence to be expressed 
recombinantly is positioned behind the sequence acting as promoter, so that the two 

30 sequences are linked covalently to each other. The distance bebween the promoter 
sequence and the nudeic add sequence to be expressed recombinantly is preferably 
lesl than 200 base pairs, espedally preferably less than 100 base pairs, very espe- 
cially preferably less than SO base pairs. Operable linkage, and an expression con- 
struct can be generated by means of customary recombination and cloning techniques 

35 as described (e.g.. in IWaniatis 1989; Silhavy 1984; Ausubel 1987; Qelvin 1990). How- 
ever, further sequences which, for example, act as a linker with specific cleavage sites 
for restriction enzymes, or as a signal peptide, may also be positioned between the two 
sequences. The Insertion of sequences may also lead to the expression of fusion pro- 
teins Preferably, the expression constnict. consisting of a linkage of promoter and nu- 

40 deic add sequence to be expressed, can exist in a vector-integrated form and be in- 
serted Into a plant genome, for example by transltomiation. 

The temis "polypeptide", "peptide", "oligopeptide", "polypeptide", "gene product". «ex- 
pressfo^produrf' and "protein" are used interdiangeably herein to refer to a polymer 
45 or oligomer of consecutive amino acid residues. 

Preferably, the term "isolated" when used in relation to a nudeic add. «^ JnJ^" j^^^^^ 
n^elc acid sequence" refers to a nudeic add sequence that is Idenftfied and sepa- 
rated frorS at least one contaminant nudeic add with which it is ordinanly associated m 
SO te nah^al source. Isolated nudeic add is nudeic add present in a form or setting that 
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is dHTerent from that in which it is found in nature. In contrast, non-isolated nucleic ac- 
ids are nucleic acids such as DMA and RNA which are found in the state they exist in 
nature. For example, a given DIsIA sequence (e.g., a gene) is found on the host cell 
chromosome in prowmity to neighboring genes; RNA sequences* such as a specific 
5 mRNA sequence encoding a specific protein, are found in the cell as a mixture with 
numerous other mRNAs which encode a multitude of proteins. However, an isolated 
nucleic add sequence comprising SEQ ID NO:1 includes, by way of example, such 
nucleic add sequences In cells which ordinarily contain SEQ ID NO:1 where the nu- 
cleic add sequence is in a chromosomal or extrachromosomal location different from 

10 that of natural ceils, or is othennnse flanked by a different nucleic acid sequence than 
that found In nature. The isolated nucleic add sequence may be present in single- 
stranded or double-stranded form. When an isolated nucleic acid sequence is to be 
utilized to express a protein, the nudeic add sequence will contain at a minimum at 
least a portion of the sense or coding strand {ie., the nudeic add sequence may be 

15 single-stranded). Altematlvely, it may contain both the sense and anti-sense strands 
(Le., the nucleic add sequence may be double-stranded). 

As used herein, the term "purified" refers to molecules, either nucleic or amino add 
sequences, that are removed from their natural environment, isolated or separated. An 
20 "isolated nucleic add sequence" is therefore a purified nucleic add sequence. "Sub- 
stantially purified" molecules are at least 60% free, preferably at least 75% free, and 
more preferably at least 90% free from other components with which they are naturally 
associated. 

25 The term ^Wild-type", ""natural" or of "natural origin" means with respect to an organism, 
polypeptide, or nucleic add sequence, that said organism is naturally occurring or avai- 
lable in at least one naturally occuning organism which is not changed, mutated, or 
otherwise manipulated by man. 

30 "Transgene", transgenic" or "recombinant* ' refers to an polynucleotide manipulated by 
man or a copy or complement of a polynucleotide manipulated by man. For instance, a 
transgenic expression cassette comprising a promoter operably linked to a second 
polynucleotide may indude a promoter that is heterologous to the second polynudeo- 
tide as the result of manipulation by man (e.g., by methods described in Sambrook et 
35 al., Molecular Cloning-A Laboratory Manual, Cold Spring Harbor Laboratory. Cold 
Spring Harbor, New York, (19B9) or Current Protocols in Molecular Biology Volumes 1- 
3. John Wiley & Sons. Inc. (1994-1998)) of an isolated nucleic acid comprising the ex- 
pression cassette. In another example, a recombinant expression cassette may com- 
prise polynudeotides combined In such a way that the polynucleotides are extremely 
40 unlikely to be found in nature. For instance, restriction sites or plasmid vector se- 
quences manipulated by man may flank or separate the promoter from the second 
polynudeotide. One of skill will recognize that polynudeotides can be manipulated in 
many ways and are not limited to the examples above. 

45 The term "transgenic" or "recombinant" when used in reference to a cell refers to a cell 
which contains a transgene, or whose genome has been altered by the introduction of 
a transgene. The term ^Iransgenic" when used in reference to a tissue or to a plant 
refers to a tissue or plant, respectively, which comprises one or more cells that contain 
a transgene, or whose genome has been altered by the intmduclion of a transgene. 

SO Transgenic cells, tissues and plants may be produced by several methods indudmg the 
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introduclten of a "Iransgene" comprising nucleic add (usually DUA) into a ^[aetc^l or 
inSralion of the tranSene into a chromosome of a target cell by way of human inter- 
vention. such as by the methods described herein. 

5 The term "transgene" as used herein refers to any nucleic acid sequence which is in- 
JSduoSlrrto the genome of a cell by experimental manipulations. A transgene may be 
an "exogenous DNA sequence," or a "heterologous DNA sequence" (/.e.. "fore^n 
DNa" -^'e t^^ "endogenous DNA sequence" refers to a "-'-f ^ --'"^^^^^^ 
is naturally found in the cell Into which it is introduced ^J'^'^^Zt^^l^^^e 

0 some modification (e.g., a point mutation, the presence of a =fjf .^f^^'^^"^^^^^ 

etc.) relative to the natuially-occumng sequence The tenn heterologous D^^ se 
ouence" refers to a nucleotide sequence which is ligated to. or is manipulated to be- 
2^me KgaSd to a nucleic acid sequence to which it is not ligated in nature, or to which 
S^rUgSed at a different location in nature. Hetemlogous DNA is not endogenous to the 

S ^11 So whfch it fe introduced, but has been obtained from another cell. Heterologous 
Dr!/il^^ckides an endogenous DNA sequence which contains some mod,l^cat,on 
G^ei«J aShough not necessarily, heterologous DNA encodes RNA and proteins that 
t^^ncn^vlioduced by the ^ell into which it is expressed. Examples o heterolo- 
□ous LJSv inlde reporter genes, transcriptional and trai^lafional regulatory s^ 

,0 ouences se edable marker proteins {e.g.. proteins which confer dnjg resistance), eta^ 
SrefSv he tern, "transgenitf- or "recombinant" with respect to a regulatory se- 
-ulnS e' a a promoter of the invention) means that sakJ regulatoof sequen^ is 
^^vaTntl^ioin^^^ to a nucleic acid to which it is not adjacent in its natural 

environment. 

The temi 'foreign gene" refers to any nucleic acid (e.g., gene sequence) which is intrcn 
I^c^^to S!^e ge^me of a cell by experimental manipulations and -^^V J^f^e g^e 

seq^n^s foun'd in that cell so long as the '^-^^^^^^^^^^^l X) rSa^e to 
tion (e.g., a point mutation, the presence of a selectable marker gene, efc.} reiauve xo 

30 the naturally-occurring gene. 

« * u. .u^ t^^n •^nsaene- or "transgenic" with respect to. for example, a nucleic 

3S in which either 

a) said nudeic acid sequence, or 

b) a genetic control sequence linked operably to said nudeic add sequence a), for 
40 example a promoter, or 

c) (a) and (b) 



45 



SO 



.s not located in Hs natural a-tic^^ IZ^^^^^^Z^^, 

refers to the natural chromosomal locus of 
a genomic library. In the case ^^^^^t |a^ The environment 

the nudeic add sequence is ^f^^^.'^^^^^l ^nd has a sequence of at least 
flanks the nudeic add sequence at least at one sioe a 
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50 bp, preferably at least 500 bp, especially preferably at least 1 000 bp, very especially 
preferably at least 5000 bp, in length. A naturally occurring expression construct - for 
example the naturally occurring combination of a promoter with the corresponding gene 
- becomes a transgenic expression construct when it is modified by non-natural, syn- 
5 thetic "artificial'' methods such as, for example, mutagenization. Such methods have 
been dasoibed (US 5,565,350; WO 00/15815). 

"Recombinant" polypeptides or proteins refer to polypeptides or proteins produced by 
recombinant DMA tedinlques, i.e., produced from cells transformed by an exogenous 
10 recombinant DMA construct encoding the desired polypeptide or protein. Recombinant 
nucleic acids and polypeptide may also comprise molecules which as such does not 
exist in nature but are modified, changed, mutated or otherwise manipulated by man. 

The term "genetically-modified organism" or "GMO" refers to any organism that com- 
15 prises transgene DNA. Exemplary organisms include plants, animals and microorgan- 
isms. 

The terms "heterologous nucleic acid sequence" or "heterologous DNA" are used inter- 
changeably to refer to a nucleotide sequence' which is ligated to a nucleic add se- 
20 quence to which it is not ligated in nature, or to which it is ligated at a different location 
in nature. Heterologous DNA Is not endogenous to the cell into which It is Introduced, 
but has been obtained from another cell. Generally, although not necessarily, such 
heterologous DNA encodes RNA and proteins that are not normally produced by the 

r-eM intn uihii^h H is eynressed. 

25 „ „ 

The temi "cell" or "plant ceir as used herein refers to a single cell. The term "cells' re- 
fers to a population of cells. The population may be a pure population comprising one 
cell type. Likewwse. the population may comprise more than one cell type. In the pre- 
sent invention, there is no limit on the number of cell types that a cell population may 

30 comprise. The cells may be synchronize or not synchronized. A plant cell within the 
• meaning of this Invention may be isolated (e.g., in suspension culture) or composed in 
a plant tissue, plant organ or plant at any developmental stage. 

The term "organ" vwth respect to a plant (or 'plant organ") means' parts of a plant and 
35 may Include (but shall not limited to) for example roots, fruits, shoots, stem, leaves, 
anthers, sepals, petals, pollen, seeds, etc. 

The term "tissue" with respect to a plant (or "plant tissue") means arrangement of mul- 
tiple plant cells Including differentiated and undifferentiated tissues of plants. Plant tis- 
40 sues may constitute part of a plant organ {eg., the epidennls of a plant leaf) but may 
also constitute tumor tissues (e.g.. callus tissue) and various types of cells In culture 
(e.g.. single ceHs. protoplasts, embryos, calll. protocom>-like bodies, efc). Plant tissue 
may be in plania, In organ culture, tissue culture, or cell culture. 

45 The term "plant" as used herein refers to a plurality of plant cells which are largely dif- 
ferentiated into a stnjcture that is present at any stage of a plant's development. Such 
stnictures include one or more plant organs Including, but are not limited to. fruit, shoot, 
stem, leaf, flower petal, etc. 

50 The term "expression" refers to the biosynthesis of a gene product For example, in the 
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case of a structural gene, expression involves transcription of the structural gene Into 
mRNA and - optionally - the subsequent translation of mRNA into one or more polypep- 
tides. 

S The term "transformation" as used herein refers to the Introduction of genetic material 
(e.g., a transgene) into a cell. Transformation of a cell may be stable or transient. The 
term' "transient transformation" or "transiently transfomned" refers to the introduction of 
one or more transgenes into a cell In the absence of integration of the transgene into 
the host cell's genome. Transient transformation may be detected by, for example, en- 

10 zyme-finked immunosorbent assay (ELiSA) >Mhich detects the presence of a polypep- 
tide encoded by one or more of the transgenes. Alternatively, transient transformation 
may be detected by detecting the activity of the protein (e.g., p-glucuronidase) encoded 
by the transgene (e.g., the uid A gene) as demonstrated herein [e.g., hlstochemical 
assay of GUS enzyme activity by staining with X-gluc which gives a blue predpitate in 

15 the presence of the GUS enzyme; and a chemiluminescent assay of GUS enzyme ac- 
tivity using the GUS-Ught kit (TTcpix)]. The term 'transient transformant" refers to a cell 
which has transiently incorporated one or more transgenes. In contrast, the term "sta- 
ble transformation" or "stably transfomned" refers to the introduction and Integration of 
one or more transgenes Into the genome of a cell, preferably resulting in chromosomal 

20 integration and stable heritability through meiosis. Stable transformation of a cell may 
be detected by Southern blot hybridization of genomic DNA of the cell with nucleic acid 
sequences which are capable of binding to one or more of the transgenes- Alterna- 
tively, stable transfbnnation of a cell, may also be detected by the polymerase chain 
reaction of genomic DNA of the cell to amplify transgene sequences. The term "stable 

25 transformant" refers to a cell which has stably integrated one or more transgenes Into 
the genomic DMA. Thus, a stable transfonmant is distinguished from a transient trans- 
formant in that, whereas genomic DNA from the stable transfomnant contains one or 
mora transgenes, genomic DNA from the transient transfonnant does not contain a 
transgene- Transformation also includes Introduction of genetic material irtto plant cells 

30 in the form of plant viral vectors involving epichromosomai replication and gene ex- 
pression which may exhibit variable properties with respect to meiotic stabUity. 

The terms "Infecting" and "infection" with a bacterium refer to co-incubation of a target 
biological sample, (e.g.. cell, tissue, efc) with the bacterium under conditions such that 
35 nucleic acid sequences contained within the bacterium are Introduced into one or more 
cells of the target biological sample. 

The term "Agmbacterium" refers to a soil-bome, Gram-negative, rod-shaped phytopa- 
thogenlc bacterium which causes crown gall. The term '•Agmbacteriunf' includes, but is 

40 not limited to, the strains Agmbacterium tumefadens, (which typically causes crown 
gall In infected plants), and Agrobacterium rhizogenes (which causes hairy root disease 
in infected host plants). Infection of a plant cell with ^robacterium generally results m 
the production of opines (e.g.. nopaline. agroplne. octoplne efc.) by the infected cell. 
Thus Agrobacterium strains which cause producUon of nopaline (e.g.. strain LBA4301. 

45 CS8 ' A20a) are referred to as "nopallne-type" Agfobat^eriar, Agnbactenum strains 
vtfhich cause production of octopine (e.g-. strain LBA4404. Ach5, BS) are referred to as 
"octopine-type" Agrobacteria: and Agrobacterium strains which cause production of 
agropine (e-g.. strain EHAIOS. EHA1Q1. A281) are referred to as "agropine-type 
Agrobact^a. 

50 
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The terms "bombarding, "bombardmerrt," and "biofistic bombardmerrt" refer to the 
process of accelerating particles towards a target biological sample (e.g., cell, tissue, 
etc.) to effect wounding of the cell membrane of a cell In ttie ta^et biological sample 
and/or entry of the particles Into the target biological sample. Methods for biolistic bom- 
S bardment are known In the art (e.g.. US 5,684.807, the contents of which are herein 
incorporated by reference), and are commercially available (e.g.. the helium gas-driven 
mlcroprojectile accelerator {PDS-1000/He) (BloRad). 

The term "microwounding" when made in reference to plant tissue refers to the intro- 
10 ducHon of microscopic wounds in that tissue. Microwounding may be achieved by, for 
example, particle bombardment as described herein. 

The terms "homology" or "identity" when used In relation to nucleic adds refers to a 
degree of complementariW- Homology or identity between two nucleic acids is under- 
1 5 stood as meaning the identity of the nucleic acid sequence over In each c^e the entire 
length of the sequence, which is calculated by comparison with the aid of the program 
algorithm GAP (Wisconsin Package Version 10.0, University of Wisconsin. Genetics 
Computer Group (QCG), Madison, USA) with the parameters being set as follows: 

20 Gap Weight: 12 Length Weight: 4 

Average Match: 2,91 2 Average Mismatch:-2.0Q3 

For evample, a sequence vinth at least 95% homology (or identity) to the sequence 
25 SEQ ID NO 1 at the nucleic acid level is understood as meaning the sequence which, 
upon comparison with the sequence SEQ ID NO. 1 by the above program algorithm 
with the above parameter set, has at least 95% homology. There may be partial ho- 
mology (Le., partial identity of less then 100%) or complete homology (i.e., complete 
identity of 100%). 

30 

Alternatively, a partially complementary sequence is understood to be one that at least 
partially inhibits a completely complementary sequence from hybndizing to a taig^ 
nudeic acid and Is referred to using the functional term "substantially homologous. 
The inhibition of hybridization of the completely complementary sequence to the target 

35 sequence may be examined using a hybridization assay (Southern or Northern blot, 
solution hybridization and the like) under conditions of low stringency. A substantially 
homologous sequence or probe (i.e., an oligonucleotide which Is capable of hybndiang 
to another oligonucleotide of interest) will compete for and inhibit the binding (i.e., the 
hybridization) of a completely homologous sequence to a target under conditions of low 

40 siingency. This Is not to say that condrtions of low stringen^^ are such Jhat "^^^ 
spedfic binding is permitted; low stringency conditions require that bindmg of two 
sequences to one another be a spedfic (he., selective) interaction^The absence of 
non-specific binding may be tested by the use of a second ta^et which »f °ks ^^f"^ 
S^ial degree of complementarity (e.g., less than about 30% Identity): in the absem^ 

45 of non-specific binding the probe vinll not hybridize to the second non-complementary 
target. 

When used In reference to a double-stranded nucleic add sequence sue*, as a cDWA 
or genomic clone, the term "substantially homologous" relets to any probe wh'ch^f" 
50 ^ibndize to either or both strands of the double^nded nucleic acd sequence under 
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conditions of low stringency as described infra. Wlnen used in reference to a single- 
stranded nudeic add sequence, the term "substantially homologous" refers to any 
probe which can hybridize to the single-stranded nudeic add sequence under condi- 
tions of low stringency as described infra. 

The tenn "hybridization" as used herein indudes "any process by which a strand of 
nudeic add joins with a complementary strand through base pairing." (Coombs 1994). 
Hybridization and the strength of hybridization (/.a, the strength of the association be- 
tween the nudeic adds) is impacted by such factors as the degree of complementanty 
1 0 between the nudeic adds, stringency of the conditions invohred, the Tm of the formed 
hybrid, and the G.C raiBo within the nudeic acids. 

As used herein, the term Tm" is used in reference to the "melting temperature." The 
melting temperature is the temperature at which a population of double-stranded nu- 

15 oleic add molecules becomes half dissodated into single strands. The aquation for 
calculating the Tm of nudeic adds is well known in the ait As indicated by starjdard 
references, a simple estimate of the Tm value may be calculated by the equabon: 
Tm=81 S+0 41(% t3+C) when a nucleic add is in aqueous solution at 1 M NaCl [see 
e g Anderson and Young, Quantitative niter Hybridization, in Nudeic Add Hybridiza- 

20 tion' (laaS)]. Other references indude more sophisticated computations which take 
structural as well as sequence characteristics into account for the calculation of Tm. 

Low stringency conditions when used in reference to nudeic acid hybridization com- 
prise conditions equivalent to binding or hybridization at 6B-C. 5" a solution consisting 

25 6fSx SSPE (43.8 g/L NaCI. 6.9 g/L NaHaPO^-H^O and 1 .85 g/L EDTA P" adjusted to 
7 4 with NaOH) 1% SDS, 5x Denhardf s reagent I50x Denhardfs contains the following 
oer 500 ml^ 5 g Rcoll 0"ype 400. Phamiada). 5 g BSA (Fraction V; Sigma)) and 100 
ug/mL denatured salmon spemi DNA followed by washing In a solution compnsing 
o!2x SSPE. and 0.1% SDS at room temperature when a DNA probe of about 100 to 

30 about 1000 nudeotides in length is employed. 

Hish stringency conditions when used in reference to nudeic acid hybridization com- 
pose conditio^ equivalent to binding or hybridization at 68" a in a sol"t.oncons,s^^ 
of 6x SSPE. 1% SDS. 5x Denhardt's reagent and 100 ^gimL denatured salmon spenn 
DlSfoHow^d by washing in a solution comprising 0.1x SSPE and 0^% SDS at B8 C. 
when a probe of about 100 to about 1000 nudeotides in length is employed. 

The tem> "equivalent" when made in reference to a ^^^'}^^^^'''' ^^"""^^^^^^ 
to a hybridiition condrtion of interest means that the hybndization condrtion and the 
40 hybriSfz^o^conditlon of interest result In hybridization of nudeu: -f ^^-^^^ 
Which have the same range of percent (%) homology. For e^mple. If a hybndization 
- ^dffion of nte^sTiBSUlte in hibridization of a first nudeic acid sequence with other 
TcfeTc add sequences that have from 80% to 90% homology to the first nucleic a^^^ 
s^LtnS^ the^a^^^ hybridization condrtion is said to be equh^lerrt to the Hybnd^a- 
A,. ^T^^^nol^^^ If this other hybridization condition also results in hybndization 
o^^fi^^nCdli^^^^^^^ withV other nudeic add sequences that have from 

80% to 90% homology to the first nudeic aad sequence. 

SS'ri SSr^"^i'^ na.^ iU nUK base C 
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ditions; factors such as the length and nature (DMA. RNA. base coinpositM of^^^^^ 
probe and nature of the target (DisiA, RNA, base compositon. present in solution or 
immobirized. etc.) and the concentration of the salts and other components (e.g., the 
presence or absence of formamide. dextran sulfate. ^^^^^"^/^^^ ^^''^''^ ^^.J^^^ 
5 ared and the hybridization soiution may be varied to generate conditions of erther low 
or high stringency hybridization different from, but equivalent to. the above^isted cona- 
tions Those skilled in the art know that whereas higher stringencies may be preferred 
to reduce or eliminate non-specific binding, lower stringencies may be preferred to de- 
tect a larger number of nucleic acid sequences having different homologies. 

to 

DETAILED DESCRIPTION OF THE INVENTION 

Accordingly, a first embodiment of the invention relates to a method for producing a 
transgenic plant comprising: 

n transforming a plant cell with a first expression cassette comprising a nucleic acid 
^ iTq^nce encod^^^ a D-amino add oxidase operably linked with a promoter allow- 
ing expression in plant cells or plants, in combination with at least one second ©c- 
p^ssion cassette suitaWe for conferring to said p.^nt an agronomically valuable 
20 trait, and 

ii) providing at least one first compound X. which is phytoto«c against P'^"* ^^"f ""J 
Lctionally expressing said D-amino acid oxidase, wherein «a'_?^f°";P^^^^^^^^ 
be metabolized by said D-amIno add oxidase into one or more compound(s) Y 
26 which are non-phytotoxic or less phytotoxlc than compound X, ana 

.iO treating said transfom^ed plant cells of step i) with said first ^^P°""^?JJ"^^^^^^^ 
toxic concentration and selecting plant cells compnsing in their 9eno'^^J^*^f^'^ 
first and said second expression cassette, wherein said first expression cassette is 
cSfeXg resistance to said transformed plant cells against said compound X by 
expression of said D-amino acid oxidase, and 

iv) praviding at least one second compound M, which is non-phytot«dc or rnoderatehr 

^ Ktol against plant cells not fundlonally expressing said ^^f ' 

pnyioioxio iaaa"'o k mMahoiized bv said D-am no acid oxidase into 

wherein said compound M can be metaboHzea oy sain ^. ^ . ^han com- 
one or more oompound(s) N which are phytotoxic or more phytotoxic than com 

pound M, and * 

breaking the combination between said first expression cass^e and said second 
^Lo^^^sion cassette and treating resulting said plant cells wrth said second com- 
Sun^M ?nrconSZtn toxic to plant cells stiH -"^Pf"^ fi^ ^^^^^^^ 
ISssette. and selecting- plant cells comprising said second expression cassette but 
lacking said first expression cassette. 

The temi -combination" or "combined" with respect to the relation between the firet and 
r ^nH^vLression cassette Is to be understood in the broad sense and is intended 
f mLTa,; m^rop^^^^^ is linking the functionality of the two expressjon 

L^ettes The first and the second expression cassette may be compnsed in one DMA 
construct but may also be separate molecules. 



as 



40 



45 



50 



9Q0D tiyVtB Sb jatia 



180" d t?8S: ■ :f dug ^:qi t?002/£0//:i : i tez- ^tdug 

BASF Plant Science GmbH 

25 

Correspondingly the term 'brealang the combination" is to be understood in the broad 
sense to comprise any method which leads to separation of the two expression cas- 
settes. The person skilled in the art is aware of various means to separate sequences 
comprised in a genome, including but not limited to segregation, sequence exdsion or 
5 deletion etc. 

The plant cell to be transformed can be an isolated cell but also part of a plant tissue, 
culture, oiaan or an entire plant at any developmental stage. Furthemiore, the plant cell 
placed under negative selection or counter selection can be isolated cells but also part 
10 of a plant tissue, culture, organ or an entire plant at any developmental stage. Between 
the described steps of the method of the invention additional steps may be comprised. 
For example, between negative selection and counter selection there might be addi- 
tional steps of e.g.. induction of excision and/or plant regeneration. 

15 As mentioned above, the first and the second expression cassette may not be com- 
bined on one DNA construct but may be employed in combination In form of - for ex- 
ample - a co-transformation approach wherein the two separate molecules are trans- 
fonned together into the plant cells. In a scenario like this the combination of the first 
and the second expression cassette can be broken e.g. by segregation (for example 

20 following reproduction of resulting plantlets). In this scenario the multiplicity of resulting 
segregated transgenic plantlets can be easily screened for lack of the first expression 
cassette by employment of compound M. which can be applied, e.g., by spraying. 

However, the first and the second expression cassette may be combined on one DNA 
25 construct. Here the combination can be broken for example by means of sequence 
specific sequence deletion or excision e.g., by employing a sequence-specific recom- 
binase or by induced sequence specific homologous recombination. 

Accorelingly. a second embodiment of the invention relates to a m^hod for pmdudng a 
30 transgenic plant comprising: 

i) transfbmiing a plant cell with a first DIMA construct comprising 



a) a first expression cassette comprising a nucleic acid sequence encoding a D- 
35 amino acid oxidase operably linked with a promoter allowing expression in plant 

cells or plants, wherein said first expression cassette is flanked by sequences 
vwhich allow for specific deletion of said first expression cassette, and 

b) at least one second expression cassette suitable for confen-ing to said plant an 
40 agronomically valuable trait, wherein said second expression cassette is not lo- 
calized between said sequences which aUow for specific deletion of said first ex- 
pression cassette, and — 

ii) providing at least one firet compound X which is phytotoxic agamst plant cells not 
45 functionally expressing saW D-amino acid oxidase, wherein said compound X can 

be metabolized by said D-amino add oxidase into one or more compound(s) Y 
vtfhich are non-phytotoxic or less phytotoxic than compound X. and 

iii) treating said transfonned plant cells of step 1) with said first compound X in a ph^o- 
50 toxic concentration and selecting plant cells comprising in their genome said first 
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DMA construct, conferring resistance to said transformed plant cells against said 
coitipound X by expression of said D-amino add oxidase, and 

iv) providing at least one second compound M. which is non-phytotoxic or modei^eiy 
6 p^^oS^c against plant cells not functionally e^J ^essinfl said ^r^^^^f^!^^^^^.^;^ 
wherein said compound Wl can he metabolized by said D-^'"''? Vf"**- 
one or more compound(s) N which are phytotoxic or more phytotox« than com- 

pound M, and 

10 inducing deletion of said first expression cassette from the geriome of said tran^ 
Srmed plart cells and treating said plant cells with said second compound M .n a 
STcen^tion toxic to plant cells still comprising said first ^^^^'f 
thereby selecting plant cells comprising said second expression cassette but lacking 
^id first expression cassette. 

In a oreferred embodiment the method of the invention further comprises the step of 
Le^^Son of a fert^^^^ The method may further Include sexually or asexually 
propagating or growing off-spring or a descendant of the plant regenerated from said 
plarrt cell. 

This invention discloses the subsequent use of a the mari<er ^^^^^^^^^^n^^^Jit^- 
aminn add oxidase (DAAO. EC 1.4.3.3) for both negative selection and counter- 

range of D-amino acids (Alonso J ef a/, (laats) Microaio,. •"-r^-'^J- 
S-ar^ino acid oxidase constitutes a dual-function matter. The nnark«- has ^en suc- 
eSaWfehed in Arsbidopsis fM/iana. and proven to be versatile. JP'dJ' Y^^J 
SSTg tJJambSuous results, and allowing selection immediately after germmafon (WO 03/ 

060133) 



15 



20 



25 



30 Many prokaryotes and eukaryotes metabolize D-amino acids ^^^^^^.f^^JJ^'^l 
Mo" L^e. Sa\ 57. 1732-174). but current information suggests that D-amino acid me- 
taboKsm is severely restricted in plants. 

However, studies of amino add transporters in plants have shown *f f^^^^' °J.*^f. 
35 prSs may mediate the transport of both L- JJ^^J^j^Tl^g^s^^^^^^ 
*ough the latter usually at lower rates (f^^nj; ^'^^^^^^^^^^^^ 

These findings '"TP'^ ^^^^^^^^^^^^^ routes, one of the most common 

amino adds D-ainino aad^^ ^.^^ ,732^1742). 

40 being oxidative deamination (Piione ^ . _enerally low. with measurable 

45 conditions would not cause negative effects on plants. 

SS™. ^"^1^%%.^"^ ^t^ninV d'Uh.b, and Conine, nore ^ 



10 



35 



880" d ■ J"" :^<^ 8^:91 17002/80/Z.l : i m 

BASF Plant Science GmbH 

27 

rably D-alanine. D-serine. and derivatives thereof. Most preferably. X is comprising 
and/or consisting of D-alanlne, D-Serine. or derivatives thereof. 

Within this Invention it is demonstrated that the toxicity of D-amino acids like e.g., D- 
serine and D-alanlne could be alleviated by the insertion of a gene encoding an en- 
2yme that metabolizes D-amino acids. Wild-type A thaliana were transfomied with the 
dao1 gene from the yeast Rhodotorula gracilis under the control of the constitutive 
promoter CaMV 3SS promoter. Exposure of this transgenic plant to D-alanine or D- 
serine showed that It could detoxify both of these D-amino adds (Fig. 4a.b). 



In another preferred embodiment the second (non-phytotoxlc. but metabolizable into 
phytotoxic) compound M is preferably comprising a D-amino acid structure selected 
from the group consisting of D-isoIeucine. D-valine. D-asparagine. D-leucine D-Iysine, 
D-proline. and D-glutamlne: more preferably D-lsoleucine. D-valine. and denvatives 
16 thereof. Most preferably. M Is comprising and/or consisting of D-lsoleucine. D^aline. or 
derivatives thereof. 



In contrast to D-amIno adds like D-serine and D-alanine. other D-amino acids like D- 
valine and D-lsoleucine, which are not toxic to wild-type Plante. have ^ 

20 influence on the growth of plants expressing DAAO (Fig. 4c.d). The ^^^^^J*'^^^?^ 
expression mrtigated the toxicity of D-serine and D-alan.ne. but '"^du^?^ m^*!^^^ 
changes that made D-isoleudne and D-valine toxic, demonstrate that the eresyme 
could provide a substrate-dependent. dual-funcHon. selectable marker in plants. Selec- 
tion is based on differences in the toxidty of different D-amino adds and their "^etabo- 

25 lites to plants. Thus, D-alanine and D-serine are toxic to plants but are metabolized by 
DAAO into nontoxic products, whereas D-isoleucine and D-valme have low toxicity, but 
are metabolized by DAAO into the toxic keto adds 3-methyl-2-oxopentanoate and 3- 
methyl-2-oxobutanoate. respecUvely. Hence^ botin positive V.!2**rdu?^n*ion 
possible with the same martcer gene, whidi Is therefore considered a dual-function 

30 marker. 

Another subject matter of the Invention relates to DNA constmcts which are suitable for 
^Zng In the method of the invention. A DNA construct suitable for use in the 
method of the invention is preferably compnsing 

a) a first expression cassette comprising a nudeic add sequence ^*=°?^"9 ^^'"^ 
add oxidase operably linked with a promoter allowing expression .n plarrt ceHs or 
p!^nt! wherein said Jrst expression cassette is flanked by sequences which allow 
for spedfic deletion of said first expression cassette, and 



40 



W at least one second expression cassette suitable for confemng to said plarrt an 
^ ag onomlcally valuable Lt. wherein said second expression cassette "o* local- 
i^id between said sequences whid, allow for specific deletion of said first expres- 



45 



sion cassette. 
I. The Dual-Function Marter of the Invention 



The term D-amlno add oxidase (abbreviated DAAO. DAMOX, or DAO) is referring to 
The term P ^'"^ ^™ °^ . J ■ . 2-0x0 add. by - preferably - employing 
the enzyme coverting a D-ammo acia imo a ^ 0x0 ««-iw. »j n \ « «^rftduct 
50 Oxygen (Oa) as a substrate and producing hydrogen peroxide (H^O^ as a coHsrodud 
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(Dixon M & Kleppe K. Biochim. Biophys. Acta 96 (1965) 357-067; Dbcon M & Kleppe K 
Biochim. Biophys. Acta 96 (1d6S) 368-382; Dixon M & Kleppe Biochim. Biophys. Acta 
96 (1965) 383-3B9; Massey V el al. Biochim. Biophys. Acta 48 (1961) 1-9. Meister A & 
Wellner D Ravoprotein amino acid oxidase. In: Boyer. P.D., Lardy, H. and MyrbSck, K. 
5 (Eds.). The £nzymes, 2nd ed-, vol. 7, Academic Press, New York, 1963, p. 609H548.} 

DAAO can be described by the Nomenclature Committee of the international Union of 
Biochemistry and Molecular Biology (lUBMB) with the EC (Enzyme Commission) num- 
ber EC 1.4,3.3. Generally an DAAO enzyme of the EC 1.4.3.3. dass is an FAD fla- 

10 voenzyme that catalyzes the oxidation of neutral and basic D-amino acids into their 
corresponding keto acids. DAAOs have been characterized and sequenced in fungi 
and vertebrates where tiiey ane known to be located in the peroxisomes. The term D- 
amino oxidase further comprises D-aspartate oxidases (EC 1.4.3.1) (DASOX) (Negri A 
et al. (1992) J Biol Chem. 267:11665-1 1871), which are enzymes structurally related to 

15 DAAO catalyzing the same reaction but adh/e only toward dicarboxylic D-amino adds. 
Within this invention DAAO of the EC 1.4.3.3. class are preferred. 

in DAAO, a consen/ed histidlne has been shown (Miyano M et ai. (1991) J Brochem 
109:171-177) to be important for the enzyme's catalytic activity. In a preferred em- 
20 bodiment of the invention a DAAO is referring to a protein comprising the following 
consensus motive: 

CLIVMHUVM^HMNHA]-Y-G«x-[QSAHGSA]-x-6-Xs-G-X-A 

2S wherein amino add residues given in. brackets represent aitemative residues for the 
respective position, x represents any amino add residue, and indices numbers indicate 
the respective number of consecutive amino add residues. The abbreviation for the 
individual amino add residues have their standard lUPAC meaning as defined above. 
A Clustal multiple alignment of the characteristic adive site from various D-amino adds 

30 is shown in Fig. 1. Further potential DAAO enzymes comprising said motif are de- 
scribed in table below; 



Acc.-No. 


Gene Name 


Description 


Source Organism 


Length 
■ 


Qld5S4 


F16E3.7 


Putative D<amino add oxidase (EC 
1 .4.3-3) (DAMOX) (DAO) (DAAO) 


Caenofhabditis elegans 


334 


P24S52 




D*arnino acid oxidase (EC 1.4.3.3) 
(DAMOX) (DAO) (DAAO) 


Fusarium solani (subsp. 
pisO (Nectiia haemato- 
cocca) 


361 


P14a20 


OAO, DAMOX 


D^imino acid oxidase (EC 1.4.3.3) 
(DAMOX) (DAO) (DAAO) 


Homo sapiens (Human) 


347 


P1BBd4 


DAO» DA01 


D-amlno acid oxidase (EC 1 .4.3.3) 
(DAMOX) (DAO) (DAAO) 


Mus musculus (Mouse) 


346 


P00371 i 


DAC 


D-amino add oxidase (EC 1.4.3.3) 
(DAMOX) (DAO) (DAAO) 


Sus scrota (Pig) 


347 


P22942 


DAO 


D-amino add oiddase (EC 1.4.3.3) 
(DAMOX) (DAO) (DAAO) 


Oryctolagus cuniculus 
(Rabbit) 


347 


035076 


DAO 


D-amino acid oxidase (EC 1.4.3.3) 
(DAMOX) (DAO) (DAAO) 


Rattus norvegtcus (Rat) 


346 
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Aee.-No. 


Gena Name 

< 


Description 


Source Organism 


Length 


paa324 


DA01 


D-amino add oxidase (EC 1.4.3.3) 
(DAMOX) (DAO) (DAAO) 


Rhodosporidium tont* 
loldes (Yeast) (Rhodoto- 
mla gracilis) 


368 


usaofiB 


DAO 


□•amino aeici oxidasd (cc 
1.4.3.3) (DAMOX) (DAO) (DAAO) 


Rhodosporidium toru- 
loides, strain TCC 
26217 


36B 


O99042 


DA01 


D-amino acid ojddase (EC 1.4.3.3) 
(DAMOX) (DAO) (DAAO) 


TrigonopdB variabilis 
(Yeast) 


356 


P3122B 


DDO 


D-«spartate oxidase (EC 1.4.3.1) 
(DASOX) (DDO) 


Bos taurus (Bovine) 


341 


089489 


DDO 


D-aspartate oxidase (EC 1.4.3.1) 
(DASO)Q (DDO) 


nuiT*D Scipicsiia ^nuiiMiii^ 






NCiJ06556.1 

1 


(AFa09689) putative Dsamino acid 
oxidase (Hypothetical pro- 
tein) ' 


Neurospora crassa 


362 






Hvoothetical Drotein 


Neurospora crassa 


390 


QBN552 




Similar to D^^spartate oxidase 


Homo sapiens (Human) 


369 


jQ7Z312 


DKFZP6a6FCM27 
2 


Hypothetical protain 
DKF2qp6&6F04272 


Homo sapiens (Human) 


330 


QgVMSO 


CGI 1236 


CG1 1236 protein (GH1254ap) 


Drosophila melanogaster 
(Fni'rtfly) 


341 



001739 



F20H11.5 



O45307 



Q6SZNS 



Q86JV2 



Q9SXG9 



C47A10.5 



CG1233a 



CG1233B 



F20H1 1.5 protein 



I Caenoifiabdftia elegans 



C47A10.5 protein 



Caenorhabditis elegans 



RE734aip 



CG12338 protein (RE4gB60p) 



Similar to Bostaurua (Bovine). D- 
dspartate oxidase (EC 1.4.3.1) (DA- 
SOX) (DDO) 



Drosophila mefanogaaier 
(Fmit fly) 



383 



343 



335 



DrosopHla melanogaster 
(Fmit fly) 



Dictyostefium discoideum 
(Slime mold) 



335 



Y69A2AR.6 



Hypothetical protein 



I Caenodiabditis elegans 



1 



070704 



07PWY6 



AGCGS3627 



AGCGS3442 



Q7PWX4 



AGCG45a72 



Q6PG95 




Q8P4M9 



Q9X7P6 



Qa2MiB 



XAC3721 



AgCP57D9 (Fragment) 



A9CP12432 (Fragment) 



AgCP12797 (Fragment) 



D-amino acid oxidase 



XCCaB7B 

SCO6740, 
SCSF2A.23C 



D-amino acid oxidase 



Putative Domino acid oxidase 



Anoplieles gambiae sir. 
PEST 



344 



Anopheles gambiae sir. 
PEST 



Anophelea gambiae str. 

PEST — 



Xanthomonas axonopo- 
dls (pv. dtfi) 



Xanthomor>Bs campestris 
(pv. campestris) 



Streptorriyces coelicotor 



DAO, SAV1672 



Putative D>amino acid oxidase 



Streptomyces avermitins 



Am 



3SS 



373 



404 



405 



320 



317 
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Acc.-No. 


Gena Name 


Daacrlption 


Source Organism 


Length 


QBVCW7 


DAOl 


D-amino acid oxidase 


Mu5 musculus (Mouse) 


34S 


Q8Z302 




D^sunino acid oxidase 


Crioetiilus griseus (Chi- 
nese hamster) 


346 


Q8Z1M5 


4 
I 




Cavia porcellus (Guinea 


347 


092220 


< 

t 
1 
1 
< 


SiiTifiartD D-aspartate oxidase 


Mu5 musculus (Mouse) 


341 


QaR2R2 




Hypothetical protein 


Mu6 musculus (Mouse) 


341 


P31228 i 


! 


■ 

D-aspartate o>cidase 


B.taunjs 


341 



Tab.1: Suitable D-amino acid oxidases from various organism. Acc.-No. refers to protein se- 
quence from SwisProt database. 



D-Amino acid oxidase (EOnumber 1.4.3.3) can be isolated from various organisms, 
5 including but not limited to pig, human, rat yeast, bacteria or fungi. Example organisms 
are Candida tropicalis, Trigonopsis variabiliSi Neurospora crassa, Chiorella vulgaris, 
and Rhodotorula gradlis. A suitable D-amino acid metabolising polypeptide may be an 
eukaryotic enzyme, for example from a yeast (e.g. Rhodotorula gracilis), fungus, or 
animal or it may be a prokaryotic enzyme, for example, from a bacterium such as Es^ 
1 0 cherichia co/A Examples of suitable polypeptides which metabolise D-amino adds are 
shown in Table 1 and Table 2 . 



'^19SS4 




P24SS2 


Fusarii solani (subsp. pisi) (Nectria haematococca) . 


JX01S2 


Fusarium solani 


P1492a 


Homo sapiens (Human) 


PI 8894 


Mus musculus (mouse) 


P00371 


Sus scrofa (pig) 


P22942 


Oryctolagus cuniculus (Rabbit) 


035078 


Rattus non/egicus (Rat) 


P80324 


Rhodosporidium toruloides (Yeast) (Rhodotomla gracilis) 


QdS042 


Trigonopsis variabilis 


Q9Y7N4 


Schlzosaccharomyces pombe (Fission yeast) SPCC1450 


001739 


Caenorhabditis elegans.F20H1 1.5 


028382 


Sus scrofa (Pig). 


033145 


Mycobacterium leprae 


Q9X7P6 


Streptomyces coeIicolor.SCSF2A23C 


Q9JXF8 


Neisseria meningitidis (serogroup B). 


Q9Z302 


Cricetulus griseus (Chinese hamster) 


Q921M5 


D-AMINO ACID OXIDASE. CavIa parcellus (Guinea pig) 



Tab.2: Suitable D-amino acid oxidases from various organism. Aoc-No. refers to protein se- 
quence from SwisProt database. 

15 

Preferably the D-amino acid oxidase is selected from the enzymes encoded by a nu- 
cleic acid sequence or a corresponding amino add sequences selected from the 
following table 3: 
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GenBanc 
Acc.-No 


Organism 


SEQ ID 


U60066 


Rhodosporidium toniloides (Yeast) 


SEQ ID NO: 1,2 


271657 


Rhodotorula gracilis 




A56901 


Rhodotorula gracilis 




AF003339 


Rhodosporidium toruloides 




AF003340 


Rhodosporidium toruloides 




U53139 


Caenorhabditis eiegans 


SEQ ID NO: 3. 4 


Dooeos 


Nectria haematococca 


SEQ ID NO: 5. 6 


Z50018. 


Trigonopsis vanabilis 


ScQ ID NO: 7, 8 


NC 003421 


Schizosacdiammyces pombe (fission yeast) 


SEQ ID NO: 9, 10 


AL93d129. 


Streptomyces coelicolor A3(2) 


1 SEQ ID NO: 11. 12 


ABQ42032 


Candida boidinii 


1 SEQ ID NO: 13. 14 I 



Tab.3: Suitable D-amino acid oxidases from various organism. Acc.-No- refers to pmtein se- 
quence from GenBank database. 



DAAO is a well-characterized enzyme, and both its crystal structure and its catalytic 
5 mechanism have been detemiined by high-resolution X-ray spectroscopy (Umhau S. et 
Bl. (2000) Proa Natl. Acad. Sd. USA 97, 12463-12466). It is a flavoenzyme located in 
the peroxisome, and its recognized function In animals is detoxification of D-amino ac- 
ids (Pilone MS (2000) CelL MoL Life. Sci. 57, 1732-174). In addition, it enables yeasts 
to use D-amlno adds for growth (Yurimoto H et aL (2000) Yeasf 16, 1217-1227). As 
10 demonstrated above, DAAO from several different species have been characterized 
and shown to differ slightly In substrate affinities (Gabler M et al. (2000) Enzyme Mh 
crob. Techno. 27, 60S-611), but In general they display broad substrate specificity, 
oxidatively deaminating all D-amino acids (except D-glutamate and D-aspartate for EC 
1.4.3.3. caiss DAAO enzymes; Pilone MS (2000) CelL MoL Life. ScL 57, 1732-174). 

15 

DAAO aoti\nty is found in many eukaryotes (Pilone MS (2000) CelL MoL Life, ScL 57, 
1732-174), but there is no report of DAAO activity in plants. The low capacity for D- 
amino acid metabolism in plants has major consequences for the way plants respond 
to D-amlno acids. For instance, the results provided herein demonstrate that growth of 
20 A thaliana In response to D-serine and/or D-alanine Is Inhibited even at quite low con- 
centrations (Fig. 1a,b). On the other hand, some D-amino acids, like D-valine and D- 
isoleucine, have minor effects on plan* growth (Fig. 1c,d) perse, but can be converted 
into toxic metabolites by action of a DAAO. 

25 In an prelterred embodiment D-amino acid oxidase expressed form the DNA-constmct 
of the invention has preferably enzymatic activity against at least one of the amino ac- 
ids selected from the group consisting of D-alanlne, D-serine. D-isoleudne, D-valine. 
and derivatives thereof. Preferably said D-amino acid oxidase ts selected from the 
group of amino add sequen€:es comprising 

30 ^ ^ 

a) the sequences described by SEQ ID NO: 2, 4, 6. B, 1 0. 12, and 14, and 

b) the sequences having a sequence homology of at least 40%, preferably 60%, 
more preferably 80%, most preferably 95% with a sequence as described by SEQ 

35 ID NO: 2, 4, 6, B, 10, 12, and 14, and 
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c) the sequences hybridizing under low or high stringency conditions - preferably 
under high stringency conditions - with a sequence as described by SEQ ID NO: 2. 
4, 6, 8. 10, 12, and 14. 

Suitable D-amino add oxidases also include fragments, mutants, derivatives, variants 
and alleles of the polypeptides exemplified above. Suitable fragments, mutants, deriva- 
tives, variants and alleles are those which retain the functional characteristics of the D- 
amino acid oxidase as defined above. Changes to a sequence, to produce a mutant, 
variant or derivative, may be by one or more of addition, insertion, deletion or substitu- 
tion of one or mora nucleotides in the nudeic add, leading to the addition, Insertion, 
deletion or substitution of one or more amino adds in the encoded polypeptide- Of 
course, changes to the nudeic add that make no difference to the encoded amino add 
sequence are included. 

The D-amino add oxidase of the Invention may be expressed in the cytosol, perox- 
isome or other intracellular compartment of the plant cell. Compartmentalisation of the 
D-amino add metabolising polypeptide may be achieved by fusing the nudeic add 
sequence encoding the DAAO polypeptide to a sequence encoding a transit peptide to 
generate a fusion protein. Gene products expressed without such transit peptides gen- 
erally accumulate in the cytosoL The localisation of expressed DAAO in the perox- 
isome produces HaOa that can be metabolised by the H^Oa degrading enzyme catalase. 
Higher levels of D-amino adds may therefore be required to produce damaging levels 
of H2O2. Expression of DAAO in the cytosol, where levels of catalase activity are lower, 
reduces the amount of D-amino add required to produce damaging levels HaOz. Ex- 
pres7sion of DAAO in the cytosol may be achieved by removing peroxisome targeting 
signals or transit peptides from the encoding nudeic add sequence. For example, the 
daot gene (EC: 1.4.3.3: GenBank Acc.-No.: U60066) from the yeast Rhodotorula 
gracUis (Rhodosporidium toruloides) was doned as described (WO 03/060133). The 
last nine nudeotides encode the signal peptide SKL, which guides the protein to the 
peroxisome sub-cellular organelle. Although no significant differences were observed 
between cytosolic and peroxisomal expressed DAAO, the peroxisomal consfruction 
was found to be marginally more effective than the cytosolic version in respect of inhib- 
iting the germination of the DAAO transgenic plants on 30 mM D-Asn. However, both 
constmcts are inhibited significantly more than the wild-type and may thus be used for 
conditional counter-seledion. 

1.1 The compounds X and M 

The tenn "Compound X" means one or more chemical substances (i.e. one chemical 
compound or a mixture of two or more compound) whidi is phytotoxic against plant 
cells not fundlonally expressing the D-amino add oxidase expressed frorn the first ex- 
pression cassette of the invention, and which can be metabolized by said D-amino acid 
oxidase Into one or more compound(s) Y.which-are non-phytotoxic or less phytotoxic 
than compound X. 

The terni "phytotoxic. "phytotoxldty- or -phytotoxic effed" as used herein is intended to 
mean any measurable, negative effect on the physiology of a plant or plant cell resum- 
ing in symptoms induding (but not limited to) for example reduced or impaired grow*, 
reduced or impaired photosynthesis, reduced or Impaired cell division, reduced or im- 
paired regeneration (e.g.; of a mature plant from a cell culture, callus, or shoot etc.). 
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reduced or impaired fertility etc. Phytotoxicity may further include effects like e.g.. ne- 
crosis or apoptosis. In an preferred embodiment results in an reduction of growth or 
regenerability of at least 50%, preferably at least 60%. more preferably at least 90% in 
comparison with a plant which was not treated with said phytotoxic compound. 

5 

The phytotoxic compound X is metabolized by said D-amino add oxidase into one or 
more compound(s) Y Which are non-phytotoxic or less phytotoxic than compound X. In 
an improved embodiment the toxicity (as for example assessed by one of the physio- 
logical Indicators exemplified above like e.g.. growth or regenerability) of the phytotoxic 

1 0 compound is reduced by the conversion to at least 50%, preferably at least 80%. more 
preferably at least 90% of the original phytotoxidty imposed by compound X. More pre- 
fen-ad this reduction results in an phytotoxic effect on plants (or plant cells) functionally 
expressing said D-amino add oxidase and treated with said compound X in compari- 
son with plants (or plant cells; regardless virtiethar expressing said D-amino add 0x1- 

15 dase or not) not treated with said compound X of not more then 30%. preferably not 
more then 15%, more preferably not more then 10 %, most preferably no statistically 
significant difference in physiology can be observed. 

The tern* 'Compound M" means one or more chemical substances (i.e. one chemical 
20 compound or a mixture of two or more compounds) which is non-phytotoxic or nrioder- 
ately phytotoxic against plant cells not functionally expressing said D-amino acid oxi- 
dase and which can be metabolized by said D-amino add oxidase Into one or more 
compound(s) N which are phytotoxic or more phytotoxic than compound M. 

25 The temi "phytotoxic', -phytotoxicity" or "phytotoxic eifed" has the same definition as 
given above. 

The term "non-phytotoxic' means that no statistically significant difference in physiology 
can be obsen/ed between plant cells or plants (not comprising a funcbonal D-amino 
30 acid oxidase) and the same plant cells or plants treated with compound M or untreated 
plants. 

The term "moderate phytotoxic' means an reduction of an physiological indicator (as 
exemplified above like e.g.. growth or regenerability) for treated plarrt ee ls or plante - 
not comprising a fundional D-amino acid oxidase - in cornpanson with untreated p ants 
or plant cells (reganJIess whether expressing said D-amino aad madase or not ) not 
Lveralbly effecftig growth and/or performance of said treated plants or p ant cells (but 
STe compound in a concentration sufficient to allow for distinguishing and/or 
separating transgenic plants (i,e.. comprising said dual ftindion martcer) from non- 
40 Lnsgeni? planti (i.e.. not comprising said maricer)). Preferably. ^^J^^'^^^^^ 
phy^logical Indicator for said treated plant cells is not more then 30%. preferably not 
more then 15%. more preferably not more then 10 %. 

The phytotoxic compound M is metabolized by said D-amino acid °**dase into one or 
45 more compound(s) N whid, are phytotoxic or more phytotoxic than ^=0"^ ^^^^^ 
improved Lbodiment the toxidty (as for example assessed by one of the phys»olog«cal 
Xtors exemplified above like e.g.. growth or regenerabllrty) of the jrO^POund M .s 
ncSsed in a way that one or more physiological indicator (as ^mplified above ^^^^ 
e.g.. growth or r^enerebllity) are reduced by at least 20%, preferably atjeast^%. 
50 more preferably at least 60%. most preferably at least 90%. The phytotoxic effect of 
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compound N in comparison to compound M is increased by at least 100% (i.e. twice), 
preferably at least 500% (i-^- 5-times), more preferably at least 1000% 0.e. 10 times). 

Various chemical compounds and mixtures thereof can be used as compound X or M. 
The person skilled in the art is aware of assay systems to asses the phytotoxidty of 
these compounds and the capability of a D-amino oxidase to metabolize said com- 
pounds in a way described above leading to decreased or increased phytotoxicity. 

Preferably at least one of the chemical substances comprised in compound X and/or M 
comprises a D-amtno acid structure. 

■ 

As used herein the term a "D-amino acid structure" (such as a "O-leucine structure", a 
"D-phenylalanine structure" or a "D-valine structure") is intended to include the D- 
amino add, as well as analogues, derivatives and mimetics of the D-amino acid that 
maintain the functional activity of the compound (discussed further below). For exam- 
ple, the term "D-phenylalanine structure" is intended to include D-phenylalanine as well 
as D-pyridylalanine and D-homophenylalanine. The term ''D-leucine structure" is in- 
tended to include D-leucine, as well as substitution with D-valine or other natural or 
non-natural amino add having an aliphatic side chain, such as D-norieudne. The term 
"D-valine structure" is intended to include D-valine, as well as substitution with D- 
ieucine or other natural or non-natural amino acid having an aliphatic side chain. 

The D-amIno acid employed may be modified by an amino-terminal or an carboxy- 
termlnal modifying group. The amino-terminal modifying group may be - for example - 
selected from the group consisting of phenylacetyl, diphenytacetyl, triphenylacetyl, bu- 
tanoyl, isobutanoyi hexanoyi, propionyl, 2Miydroxybutanoyl, 4-hydroxybutanoyl, 3- 
hydroxypropionoyi, 2,4-dihydroxybutyroyl, 1-Adamantanecarbonyl, 4~methylvaleryl, 2- 
hydroxyphenylacetyl, 3-hydroxyphenylacetyl, 4-hydrQxyphenylacetyl, 3.S-dihydroxy-2- 
naphthoyl, 3,7-dlhydroxy-2-napthoyl, 2-hydroxycinnamoyl, S-^hydroxycinnamoyl, 4- 
hydroxycinnamoyl, hydrocinnamoyi, 4~formylcinnamoyl, 3-hydroxy-4- 
methoxycinnamoyl, 4-hydroxy-3-methoxycinnamoyli 2-carboxycinnamoyl, 3,4,- 
dihydroxyhydrocinnamoyi, 3,4-dihydroxycInnamcyi, trans-CmnamoyI, (.+-0-n^Si<^c'eIyL 
(.+-.)-fnandelyl-(.+-.)-mandelyl, glycolyl, 3-formylbenzoyl, 4-formylbenzoyl, 2- 
fomiylphenoxyacety I, 8-f ormyH -napthoy I, 4-(hy droxymethyl)ben2oyl, 3- 
hydroxybenzoyi, 4-hydroxybenzQyI, S-hydantoinacetyl, L-hydroorotyl, 2,4- 
dihydroxybenzoyl» 3-benzoylpropanoyl, (.+-.)-2,4-<lihydroxy-3,3-dimethylbutanoyl, DL- 
3-(4-hydroxyphenyl)lactyl, 3-(2-hydroxyphenyl)propionyl, 4-(2-hydroxyphenyl)propionyl, 
D-3-phenyllactyl, 3-(4-hydroxyphenyl)proplonyl, L-3-phenyllactyl. 3-pyridyIacetyI, 4- 
pyridylacetyl, isonicotinoyi, 4-quinolinecarboxyl, 1-isoqulnolinecarboxyI and 3- 
isoquinoBnecarboxyl The carboxy-tenrnlnal modifying group may be - for example - 
selected from the group consisting of an amide group, an alkyl amide group, an aryl 
amide group and a hydroxy group. 

The terms "analogue", 'derivative" and "mimetic" as used herein are intended to in- 
clude molecules which mimic the chemical sfructure of a D-amlno acid structure and 
retain the functional properties of the D-amino acid structure. Approaches to designing 
amino acid or peptide analogs, derivatives and mimetics are known in the art. For ex- 
ample, see Farmer, P. S. In Drug Design {E. J, Ariens, ed.) Academic Press, New 
Yoric, 1980, vol. 10. pp. 119-143; Ball. J, B. and Alewood, P. F. (1990) J. Mol. Recogni- 
tion 3:55; Morgan, B. A. and Galnor, J. A. (19B9) Ann. Rep. Med. Chem. 24:243; and 
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Freidinger, R. M. (1989) Trends Pharmacol. Sci. 10:270. See also Sawyer, T. K. (1995) 
"Peptidomimetic Design and Chemical Approaches to Peptide Metabolism" in Taylor, 
M. D. and Amidon, G. L (edB*) Peptide-Based Drug Design: Controlling Transport and 
Metabolism, Chapter 17; Smith, A. B. 3rd. et al. (1995) J. Am. Chem. Soc 117:11113- 
11123; Smith, A. B. 3rd, et al. (1994) J, Am. Chem, Soc- 116:9947-9962; and Hirsch- 
man. R.. etaL (1993) J. Am. Chem. Soc. 115:12550-12568. 

As used herein, a ''derivative" of a compound X or M (e.g., a D- amino acid) refers to a 
form of X or M in which one or more reaction groups on the compound have been deri- 
vatized with a substituent group. Examples of peptide derivatives include peptides in 
which an amino acid side chain, or the amino- or carfaoxy-terminus has been derivat- 
ized. As used herein an "analogue*' of a compound X or M refers to a compound which 
retains chemical structures of X or M necessary for functional activity of X or M yet 
which also contains certain chemical structures which differ from X or M, respectively. 
As used herein, a "mimetic" of a compound X or M refers to a compound in which 
chemical structures of X or M necessary for functional activity of X or M have been re- 
placed with other chemical structures which mimic the confomnation of X or M, respec- 
threly. 

Analogues are intended to include compounds in which one or more D-amino acids are 
substituted with a homologous amino acid such that the properties of the original com- 
pound are maintained. Preferably conservative amino acid substitutions are made at 
one or more amino acid residues. A "conservative amino acid substitution" is one in 
which the amino acid residue is replaced with an amino acid residue having a similar 
side chain. Families of amino acid residues having similar side chains have been de- 
fined in the art, including basic side chains (e.g., lysine, arginine, histidine), acidic side 
chains (e.g., aspartic acid, glutamic acid), uncharged polar side chains (e.g., glycine, 
asparagine, glutamine, serine, threonine, tyrosine, cysteine), nonpolar side chains 
(e.g./ alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, trypto- 
phan), (5-branched side chains (e.g., threonine, valine^ isoleucine) and aromatic side 
chains (e.g., tyrosine, phenylalanine, tryptophan, histidine). Non-limiting examples of 
homologous substitutions that can be made include substitution of D-phenylalanine 
with D-tyrosine, D-pyridylalanine or D-homophenylalanine, substitution of D-feudne 
with D-valine or other natural or non-natural amino acid having an aliphatic side chain 
and/or substitution of D-vallne with D-leudne or other natural or non-natural amino add 
having an aliphatic side chain. 

Other possible modifications include N-alkyI (or aryl) substitutions, or backbone 
crosslinking to construct lactems and other cyclic structures. Other derivatives include 
C-tenminal hydroxymethyl derivatives, O-modified derivatives (e.g.. C-termlnai hy- 
droxymethyl benzyl ether), N-terminally modified derivatives including substituted am- 
ides such as alkylamides and hydrazides. 

In certain embodiments the D-amlno acid structure is coupled directly or indirectly to at 
least one modifying group (abbreviated as MG). The term "modifying group" is Intended 
to include structures that are directly attached to the D-amino acid structure (e.g., by 
covalent coupling), as well as those that are indirectly attached (e.g., by a stable non- 
covalent association or by covalent coupling to additional amino acid residues). For 
example, the modifying group can be coupled to the amino-terminus or carboxy- 
terminus of a D-amlno acid structure. Modifying groups covalently coupled to the D- 
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amino add structure can be attached by means and using methods well known in the 
art for linking chemical structures, including, for example, amide, alkylamino, car- 
bamate, urea or ester bonds. In a preferred embodiment the modifying group(s) com- 
prises a cydic, heterocyclic, polycyclic or branched alkyi group. 

5 

No endogenous D-amino acid oxidase activity has been reported in plants. Compound 
X or M, respectively, as substrates for the D-amino acid oxidase may be a D-amIno 
add structure comprising the structure of D-arginine, D-glutamate, D-alanine, I> 
aspartate, D-cysteine, D-glutamine, D-histidine, D-isoleucine, D-leucine. D-lysine, D- 

10 methionine, D-asparagine, D-phanylalanine, D-proline, D^erine, D-threonlne, D- 
tryptophane, D-tyrpsine or D-vallne. Preferably compound X and M Is comprising D- 
arginine, D-glutamate, D-alanine, D-aspartate, D-cysteine, D-glutamine, D-histidine. D- 
isoleudne, D-leudne, D-lysine, D-methionine, D-asparagine, D-phenylalanine, 
proline. D-serine, D-threonine, D-tryptophane, D-tyrosine or D-valine. Other suitable 

15 substrates for D-amino add metabolising enzymes include non-protein dextrorotatory 
amino adds, precursors of dextrorotatory amino adds and dextrorotatory amino acid 
derivatives. Suitable precursors indude D-omithine and D^itrulline. 

The fact that compound X and M preferably comprise a D-amino add structure does 
20 not rule out the presence of L-amino add structures or L-amino acids. For some appli- 
cations it may be preferred (e.g., for cost reasons) to apply a racemic mixture of D- and 
L-amino adds (or a mixture with enriched content of D-amIno adds). Preferably, the 
rafio of the D-amino add to the corresponding L-enantiomer is at least 1:1, preferably 
2:1, more preferably S:1, most preferably 10:1 or 100:1. 

26 

The preferred compound may be used in isolated form or in combination with other 
substances. For the purpose of application, the compound X or M are advantageously 
used together with the adjuvants conventionally employed in the art of formulation, and 
are therefore formulated in known manner, e.g. Into emulsifiable concentrates, coatable 
30 pastes, diredly sprayable or dilutable solutions, dilute emulsions, wetlable powders, 
soluble powders, dusts, granulates, and also encapsulations In e.g. polymer sub- 
stances. As with the nature of the compositions to be used, the methods of application, 
such as spraying, atomising, dusting, scattering, coating or pouring, are chosen In ac- 
cordance with the intended objectives and the prevailing drcumstances. 

35 . 
The formulations, i.e. the compositions, preparations or mbctures containing compound 

X or M (active ingredient), and, where appropriate, a solid or liquid adjuvant, are pre- 
pared in known manner, e.g. by homogeneously mbcing and/or grinding the active In- 
gredients with extenders, e.g. solvents, solid carriers and, where appropriate, surface- 
40 active compounds (surfactants). 

Suitable solvents are: aromatic hydrocarbons, preferably tlie fractions confining 8 to 
12 carbon atoms, e.g. xylene mixtures or substituted naphthalenes, phthalates such as 
dibutyl phthalate or dioctyl phthalate, aliphatic hydrocarbons such as cyclohexane or 
45 paraffins, alcohols and glycols and their ethers and esters, such as ethanol, ethylene 
glycol, ethylene glycol monomethyl or monoethyl ether, ketones such as cyclohexa- 
none, strongly polar solvents such as N-methyl-2-pyrroiidone. dimethyl sulfoxide or 
dimethylformamide. as well as vegetable oils or epoxidised vegetable oils, such as ep- 
oxidi&ed coconut oil or soybean oil; or - preferably - water. 
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The solid carriers used e.g. for dusts and dispersible powders are normally natural 
mineral fillers sudi as calcite, talcum, kaolin, montmorillonite or attapulgite. In order to 
improve the physical properties it Is also possible to add highly dispersed silicic acid or 
highly dispersed absorbent polymers. Suitable granulated adsorptive carriers are po- 
5 rous types, for eicample pumice, broken brick, sepiolite or bentonite; and suitable non- 
sorbent carriers are. for example, calcite or sand. In addition, a great number of pre- 
granulated materials of inorganic or organic nature can be used. e.g. especially dolo- 
mite or pulverised plant residues. 

10 Depending on the nature of the compound X or M to be fonnulated suitable suriace- 
active compounds are nonionic, cationic and/or anionic surfactants having good 
emulsifying, dispersing and wetting properties. The term "surfactants" will also be 
understood as comprising mbetures of surfactants. 

15 Both so-called water-soluble soaps and also water-soluble synthetic surface-active 
compounds are suitable anionic surfactants. Suitable soaps are the alkali metal salts, 
alkaline earth metal salts or unsubstituted or substituted ammonium sahs of higher fatty 
acids (Cio -C22). e.g. the sodium or potassium salts of oleic or stearic acid or of natural 
fatty add mixtures which can be obtained e.g. from coconut oil or tallow oil. Fatty acid 

20 methyttaurin salts may also be mentioned as surfactants. 

More frequently, however, so-called synthetic surfactants are used, especially fatty 
sulfonates, fatty sulfates, sulfonated benzimidazole derivatives or alkylarylsulfonates. 

25 The fatly sulfonates or sulfates are usually in the fomn of alkali metal salts, alkaline 
earth metal salts or unsubstituted or substituted ammonium salts and contain a C.sub.8 
-C-sub.22 alkyl radical which also includes the alkyi moiety of acyl radicals, e.g. the 
sodium or calcium salt of lignosulfonic acid, of dodecylsulfate or of a mixture of fatty 
alcohol sulfates obtained from natural fatty acids. These compounds also comprise the 

30 salts of sulfated and sulfonated fatty alcohol/ethylene oxide adducts. The sulfonated 
benzimidazole derivatives preferably contain 2 sulfonic acid groups and one fatty acid 
radical containing 8 to 22 carbon atoms. Examples of alkylarylsulfonates are the so- 
dium, calcium or triethanolamine salts of dodecylbenzenesulfonic acid, 
dibutylnaphthalenesulfonic add, or of a condensate of naphthalenesulfonic acid and 

35 formaldehyde. Also suitable are corresponding phosphates, e.g. salts of the phosphoric 
acid ester of an adduct of p-nonylphenol with 4 to 14 moles of ethylene oxide, or 
phospholipids. 

IMon-ionic surfactants are preferably polyglycol ether derivatives of aliphatic or 
cydoaliphatic alcohols, saturated or ur^saturated fatty acids and alkylphenols, said de- 

40 rivatives contains 3 to 30 glycol ether groups and 8 to 20 carbon atoms in the (ali- 
phatic) hydrocarbon moiety and 6 to 18 carbon atomsjn the allcyl moiety of the alkyl- 
phenols. Further suitable non-Ionic surfactants are the water-soluble adducts of poly- 
ethylene oxide with polypropylene glycol, ethylenediaminopolypropylene glycol and 
alkylpo'yPfopy^®"® S'ycol containing 1 to 10 carbon atoms In the alkyl chain, which ad- 

45 ducts contain 20 to 250 ethylene glycol ether groups and 10 to 100 propylene glycol 
ether groups. These compounds usually contain 1 to 5 ethylene glycol units per propyl- 
ene glycol unit. Representative examples of nonnonic surfactants are nonylphenolpoiy- 
ethoxyethanols, castor oil polyglycol ethers, polypropylene/polyethylene oxide adducts, 
tributylphenoxy-pQiyethoxyethanoI. polyethylene glycol and octylphenoxy- 
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polyethoxyettianol. Fatty acid esters of polyoxyethylene sorbitan, e.g. polyoxyethylene 
sorbitan trioleate, are also suitable. 

Cationic surfactants are preferably quaternary ammonium salts whic*i contain, as N- 
5 substituent, at least one Ga -C22 alkyi radical and, as further substituents, unsubstituted 
or haloflenated lower alkyI, benzyl or hydroxy-tewer alkyI radicals. The salts are pref- 
erably In the form of halides, methylsulfates or ethylsuH^tes, e.g. stearyltrimethyl- 
ammonium diloride or ber\zyldi(2-ch!oroethyl)ethylammonium bromide. 

10 The surfactants customarily employed in the art of fbrmulation are described e.g. in the 
following publications: "McCutcheon's Detergents and Emulsifiers Annual" MC Publish- 
ing Corp.. RIdgewood. N.J., 1981. Stache. H., "Tensid-Taschenbuch", Cart Hanser 
Verlag MunicJWienna 19B1. 

15 The compositions usually contain 0.1 to 99% by weight, preferably 0.1 to 96% by 
weight, of a compound X or M. 1 to 99.9% by weight, preferably 5 to 99.8% by weight, 
of a solid or liquid adjuvant and 0 to 25% by weight, preferably 0.1 to 25% by weight, of 
a surfactant. 

20 The compositions may also contain further ingredients such as stabilizers, antifoams, 
viscosity regulators, binders, tackifiers as well as fertilizefs or other active ingredients 
for obtaining special effects. 

Various methods and techniques are suitable for employing compound X or M or com- 
25 positions containing them for treating plant cells or plants. Such method may Include 

i) Incorporation Into liquid or solidified media or substrates utilized during transforma- 
tion, regeneration or growth of plant cells, plant material or plants. 

30 fi) Seed dressing 

iii) Application by spraying (e.g. from a tank miscture utilizing a liquid fonnulation) 

1.1.1 Compound X 

35 Preferably compound X Is comprising a substance comprising a stmcture selected from 
the group of consisting of D-tryptophane, D-histidine. D-arginine, D-threonine, D- 
methionine D-serine, and D-alanine. more preferably a structure selected from the 
group consisting of D-serine, and D-alanine. Most preferably compound X is compns- 
ing a substance comprising the structure of D-alanine. 
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Preferably compound X is comprising a substance selected from the group of consist- 
ing of D-tryptophane, D-histidine, D-arglnine. DHhreonine, D-methionine. D-senne. and 
D-alanine. more preferably selected from the group consisting of D-senne. and D- 
alanine. Most preferably compound X is comprising D-alanine. 

The use of D-alanine has the advantage that racemic mixtures of D- and L^lanine can 
be applied without disturbing or detrimental effects of the L-enantiomer. Therefore, in 
an improved embodiment an racemic mixture of D/L-aianine Is employed as compound 
X. 
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Furthermore. D-amino acid structure comprising herbicidal compounds may be em- 
ployed as compound X. Such compounds are for example described in US 5,059,239. 
and may include (but shall not be limited to) N-ben2oyl-N-(3-chloro-4-fluorophenyI)-DL- 
alanine, N-benzoyl4>l-(3Hc:hloro-4-fluorophenyl)-DL-alanine methyl ester, N-benzoyl-N- 
5 (3-chloro-4-fluorophenyl)-DL-alanine ethyl ester, N-benzoyl-N-(3-chloro-4- 
fluorophenyl)-i>a/an/nej N-'benzoyl*N'(3-chlorD^fluorophenyl)-D-a/dn/i7e methyl ester, 
or N-bBn2oyl-N-(3-chloro-4-fluorophenyI)-D-a/ami7e isopropyl ester. 

When applied via the cell culture medium (e.g.. incorporated into agar-soiidified MS 
10 media plates), D-alanine can be employed in concentrations of about 0.1 mM to about 
100 mlWI. preferably about 0.3 mM to about 30 mM, more preferably about 1 mM to 
about 5 mM. 

\A/hen applied ^ria the cell culture medium (e.g.. incorporated into agar-solidrfied MS 
15 media plates), D-serine can be employed in concentrations of about 0.1 to about 10 
mM, preferably about 0.3 to 4 mM, more preferably about 0.5 mM to about 1 .5 mM. 

1.1.2 Compound M 

Preferably compound M is comprising a substance comprising a structure selected 
20 from the group of consisting of D-isoleuc!ne, D-valine, D-asparagine. D-leucine, D- 
lysine, D-proline, and D-glutamine, more preferably a structure selected from the group 
consisting of D-isoleucine, and D-valine. Most preferably compound M is comprising a 
substance comprising the structure of D-isoleucine. 

25 Preferably compound M is comprising a substance selected from the group of consist- 
ing of D-!soIeudne, D-valine, D-asparagine, D-leucine. D-lysine, D-prollne, and D- 
glutamine, more preferably selected from the group consisting of D-isoleucine, and D- 
valine. Most preferably compound M is comprising D-isoleudne. 

30 When applied via the cell culture medium (e.g., incorporated into agar-soiidified MS 
media plates), D-isoleucine can be employed in concentrations of about 0.1 mM to 
about 100 mM, preferably about 1 mM to about 50 mM. more preferably about 10 mM 
to about 30 mM. 

35 When applied via the cell culture medium (e.g., incorporated into agar-soiidified MS 
media plates), D-valine can be employed in concentrations of about 1 to about 100 
mM, preferably about 5 to 50 mM, more preferably about 15 mM to about 30 mM. 

When applied via the cell culture medium (e.g., incorporated into agar-soiidified MS 
40 media plates), D-asparagine or D-glutamine can be employed in concentrations of 
about 0.5 to about 100 mM, preferably about 1 to SO mM, more preferably about 3 mM 
to about 20 mM. ~" 

1.1.3 Mode of Application 

45 As described above, the selection can be done during any step of plant cell culture, 
regeneration or plant growth. Surprisingly, the D-amino add compounds are able to 
exhibit their growth modulating properties not only during cell culture (e.g., when ap- 
plied on isolated plant cells, shoots or plantlets) but also later when applied on plants 
via spraying. When applied via spraying, D-alanine may be applied In concentrations of 
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about S to about 100 mM. preferably from about 10 to about 80 mM. more preferably 
from about 40 to about 60 mM. When applied via spraying. D^enne may be appl'sd «" 
concentrations of about S to about 80 mM. preferably from about 10 to about 60 mM. 
more preferably from about 20 to about 40 mM. 

5 

II. The Marker Excision Feature of the invention 

It is one essential feature of the Invention that the dual-function marker of the invention 
is soecifioally deleted after its use. Preferably, deletion of the first expression cassette 
encoding for said dual-funcUon marker can be realized by various means known in the 
1 0 art. including but not limited to one or more of the followng methods: 

a) recombination induced by a sequence specific recomblnase. wherein said first ex- 
oresslon cassette is flanked by corresponding recombination srtes in a v«y that re- 
combination between said flanking recombination sites results in deletion of the se- 
1 5 quences In-between from the genome {for specific embodiments see IIL1 below). 

W homologous recombination between homology sequences A and A' flanking said 
first expression cassette, preferably induced by a sequence-specific double-strand 
break between said homology sequences caused by a sequence specific endonu- 
20 clease. wherein said homology sequences A and A' have ^"^'"^"Vf "1^°" 
mology in onJer to ensure homologous recombination between A and A and having 
an orientation which - upon recombination between A and A' - will lead to e«.s^ 
of said firet expression cassette from the genome of said plant (for specific embodi- 

mm-^Aw**a cAc:^ III 9 KelnWV.. 

^ Accordingly, for ensuring marker deletion / excision the expression ^^^^f^^l^l^' 
amino add oxidase (the first expression constmct) compnsed in the DMA construct of 
the invention is flanked by sequences which allow for specific deletion of sa.d expres- 
sion cassette. Said sequences may be recombination sites for a sequence specific 

30 re«rSfnai which are placed in a way the recombination induced b-^^" ^^^"^^^ 
Ing recombination sites results in deletion of the said first ^'^'^'^ ^^^^^^^^^^ 
o^ome There are various recombination sites and corresponding sequence specific 
?e^Xas^ known in the art (described herein below), which can be employed for 
the purpose of the invention. 



45 



In another preferred embodiment, deletion / excision of the dual-maiker sequence is 
perfoLd fy intramolecular (preferably intrachromosomal) ^^^^l^^^Z'ST^l 
Homologous recombination may occur spontaneous but is preferably induced by a s^ 
quTno^speclfic double-strand break (e.g., between the ^^"^"^^^l^^^^^^^^^ 
basic Principals are disclosed In WO 03/004659. For this purpose the first expression 
c^nSrSbTrencoding for the dual-function maricer) is flanked by homology sequences A 
^d^thl^S:^dZmo\cQy sequences have sufficient length and^.omology in order 
to ensure ho^^^ogous recombination between A and A. and having an onentation 
l:ru%reUbinatlon --eri A^^^^^^^ 

:^rnr^rr rp^eT^t^Ltt^r^^^^ of 10 base 

Sir^ fo?^e ste-dired^ induction of DMA double-strand breaks by a ^P^" 
df^TDNA double-strand break inducing enzyme, preferably a ^^^^^^^^^f""^"^ 
endonudease. more preferably a homing-endonudease, most preferably a endonude- 
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ase selected from the group consisting of i-Scel, l-Cpal, l-Cpali, l-^Crel and l-Chul or 
chimeras thereof with ligand-binding domains. Suitable endonuclease are described 
herein below. 

5 Recombination Sites and Recomblnases of tha invention 

Sequence specific recombinases and their corresponding recombination sites suitable 
within the present Invention may include but are not limited to the Cre/lox system of 
the bacteriophage PI (Dale EC and Ow DW (1991) Proc Natl Acad Sci USA BB:1055S- 
10562; Russell SH et ai. (1992) Mol Gene Genet 234: 49-59; Osborne Bl etal. (1995) 

10 Plant J. 7, 687-701), the yeast FLP/FRT system (Kilby NJ et al. (199S) Plant J 8:637- 
652; Lyznik U< et el. (1996) Nucleic Acids Res 24:3784-3769), the lUlu phage Gin re- 
combinase, the E. coli Pin recombinase or the R/RS system of the plasmid pSRI 
(Onouchi H et al.(1995) Mol Gen Genet 247:653-660; Sugita Ket al. (2000) Plant J. 
22:461-469). The recombinase (for example Cre or FLP) interacts specifically with its 

15 con'esponding recombination sequences (34 bp iox sequence and 47 bp FRT se- 
quence, respectively) in order to delete or invert the interposed sequences. Deletion of 
standard selection marker in plants which was flanked by two Iox sequences by the Cre 
is described (Dale EC and Ow DW (1991) Proc Natl Acad Sci USA 86:10558-10562). 
The preferred recombination sites for suitable recombinases am described in Table 4 

20 below: 



Recomfai- 
naae 


Organism 
of origin 


Recombination Sites 


CRE 


Bacteriophage P1 


S*-AACTCTCATC(3CTTCGGATAACTTCCTGTTATCCGAAA 

CATATCACTCACTTTGQTSATTTCACCQTAACTeTC- 

TATGATTAATG-3* 


FLP 


Saccharomyces 
cerevisiae 


5*-GAAGTTCCTATrCCGAAGTTCCTATTCTCTAGAA AG- 
TATAGGAACTTC-S' 


R 


pSR1 
Plasmids 


S'-CGAGATCATATCACTGTGGACGTTGATGAAAGAATAC 
GTTAl ICII iCATCAAATCGT 



Tab 4: Suitable sequence specific recombinases for use in the method of the invention. 



111.2 The Homology Sequences 

2S Referring to the homology sequences (e.g., A. A') "sufficient length" preferably refers to 
sequences with a length of at least 20 base pairs, preferably at least 50 i»se pairs, 
especially preferably at least 100 base pairs, very especially preferably at least 250 
base pairs, most preferably at least 500 base pairs. 

30 Referring to the homology sequences (e.g.. A, A*), 'sufficient homology" preferably re- 
_ fers to sequences with at least 70%, preferably 80%, by preference at least 90%, es- 
pecially preferably at least 95%, very especially preferably at least 99%, most prefera- 
bly 1 00%. homology within these homology sequences over a length of at least 20 
base pairs, preferably at least 50 base pairs, especially preferably at least 100 base 
35 pairs, very especially preferably at least 250 base pairs, most preferably at least 500 
base pairs. 

The homology sequences A and A' are preferably organized in the form of a direct re- 
peat The term "direct repeaf means a subsequent localization of two sequences on 
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the same strand of a DMA molecule in the same orientation, wherein these two se- 
quences fulfill the above given requirements for homologous recombination between 
said two sequences. 

S In a preferred embodiment, the homology sequences may be a duplication of a se- 
quence having additional use within the DMA constnjct. For example, the homology 
sequences may be tMO transcription tenninator sequences. One of these terminator 
sequences may be operably linked to the agronomlcally valuable trait, while the other 
may be linked to the dual-function selection mailcer, which is localized In 3'-directlon of 

10 the trait gene. Recombination between the two terminator sequences will excise the 
marker gene but will reconstitute the tenninator of the trait gene. In another example, 
the homology sequences may be two promoter sequences. One of these promoter 
sequences may be operably linked to the agronomically valuable trait, while the other 
may be linked to the dual-function selection marker, which Is localized in 5*-direction of 

15 the trait gene. Recombination between the two promoter sequences will ©»ase the 
marker gene but will reconstitute the promoter of the trait gene. The person skilled in 
the art will know that the homology sequences do not need to be restricted to a single 
ftincttonal element (e-g. promoter or temiinator). but may comprise or extent to other 
sequences (e.g. being part of the coding region of the trait gene and the respective 

20 tenminator sequence of said trait gene. 

1II.3. Double-Strand Break Inducing Enyzme of the invention 

Preferably deletion / excision of the dual-function maricer is realized by homologous 
recombination between the above specified homology sequences induced by a se- 

2S quence-spedfic double^rand break, preferably between the homology fe^V^^s 
which should recombine- General methods are disclosed for example in WO 
03/004659, incorporated herein entirely by reference. Various enzyme suitable for in- 
duction of sequence-specific double-strand breaks (hereinafter together endonucle- 
ase') are known in the ait The endonuclease may be for example selected from the 

30 group comprising: 

1. Restriction endonucleases (type II). preferably homing endonudeases as de- 
scribed in detail hereinbelow. 
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35 2. 



Transposases, for example the P-element transposase (Kaufman PD and Rio DC 
(1 992) Cell 69(1);27-39) or AcDs (Xiao YL and Peterson T (2000) Mol Gen Genet 
263(1)-22-29) In principle, all transposases or integrases are suitable as long as 
they have sequence spedflcity (Haren L et al. (1999) Annu Rev Microbiol. 
1999;53:245-281; Beall EL. Rio DC (1997) Genes Dev. 11(16):21 37-21 51). 

3. Chimeric nudeases as described in detail hereinbelow. 

4 Enzymes which induce double-slrand breaks in the immune sftem. such as the 
RAG1/RAG2 system (Agrawal A et al. (1998) Nature 394(6695):744-4S1). 

5 Group II intron endonudeases. Modifications of the intron sequence allows group 
II Inlrons to be directed to virtually any sequence in a double-stranded DNA, 
where group II introns can subsequently insert by means of a reverse splice 
mechanism (Mohr et al. (2000) Genes & Development 14:569-573; Guo et al. 
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(2000) Science 289:452- 457). During this reverse splice mechanism, a double- 
strand hrcak is Introduced into the target DMA, the excised intron RNA deavfng 
the sense strand while the protein portion of the group II intron endonuclease hy- 
drolyses the antisense strand (Guo et al. (1997) EMBO J 16; 6835- 6B48). If it is 
5 only desired to induce the double-strand break without achieving complete re- 

verse splicing, as is the case in the present invention, it is possible to resort to, 
for example, group II intron endonudeases which lack the reverse transcriptase 
activity. While this does not prevent the generation of the double-strand break, 
the reverse splicing mechanism cannot proceed to completion. 

10 

Suitable enzymes are not only natural enzymes, but also synthetic enzymes- Preferred 
enzymes are all those endonudeases vuhose recognition sequence is knovm and which 
can either be obtained In the form of their proteins (for example by purification) or ex- 
pressed using their nucleic acid sequence. 

15 

In an preferred embodiment a sequence-specific endonudease is employed for specific 
induction of double-strand breaks and subsequent induced homologous recombination. 
The term "Sequence specific DNA-endonudease'* generally refers to all those enzymes 
which are capable of generating double-stnand breaks in double stranded DNA In a 

20 sequence-specific manner at one or more recognition sequences. Said DNA cleavage 
may result In blunt ends, or so called "sticky" ends of the DNA (having a 5'- or 3- 
overhang). The cleavage site may be localized within or outside the recognition se- 
quence. Various kinds of endonudeases can be employed. Endonudeases can be, for 
example, of the Class II or Class Us type. Class lis R-M restriction endonudeases cata- 

25 lyze the DNA cleavage at sequences other than the recognition sequence, I.e. they 
cleave at a DNA sequence at a particular number of nudeotides away from the recog- 
nition sequence (SzybalskI et aL (1991) Gene 100:13-26). The following may be men- 
tioned by way of example, but not by limitation: 

30 1. Restridion endonudeases (e.g., type II or Us), preferably homing endonudeases 

as described in detail hereinbelow. 

2. Chimeric or synthetic nudeases as described in detail hereinbelow. 

35 Unlike recombinases, restriction enzymes typically do not ligate DNA, but only cleave 
DNA. Restriction enzymes are described, for Instance, In the New England Biolabs 
online catalog (www.neb.com). Promega online catalog (yAVW.promega.com) and Rao 
et al. (2000) Prog Nudeic Acid Res Mol Biol 64:1-63. Within this invention "ligation* of 
the DNA ends resulting from the deavage by the endonudease Is realized by fusion by 

40 homologous recombination of the homology sequences. 

Preferably, the endonudease is chosen in a way that its conresponding recognition 
sequences are rarely, if ever, found in the unmodified genome of the target plant 
organism. Ideally, the only copy (or copies) of the recognition sequence In the genome 
45 is (or are) the one(s) introduced by the DNA construct of the invention, thereby 
eliminating the chance that other DNA In the genome is exdsed or rearranged when 
the sequence-specific endonudease is expressed. 

One criterion for selecting a suitable endonudease is the length of its con-esponding 
SO recognition sequence. Said recognition sequence has an appropriate length to allow for 
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' rare cleavage, more preferably cleavage only at the recognition sequence(s) comprised 
in the DMA construct of the invention. One factor determining the mininnum length of 
said recognition sequence is - from a statistical point of view - the size of the genome 
of the host organism. In an preferred embodiment the recognition sequence has a 
5 length of at least 10 base pairs, preferably at least 14 base pairs, more preferably at 
least 16 base parrs, especially preferably at least IS base pairs, most preferably at 
least 20 base pairs. 

A restriction enzyme that cleaves a 1 0 base pair recognition sequence is described in 
10 Huang B et aL (1996) J Protein Chem 15(5):4B1-g . 

Suitable enzymes are not only natural enzymes, but also synthetic enzymes. Preferred 
en^mes are all those sequence specific DNA-endonucleases whose recognition se- 
quence is known and which can either be obtained in the forni of their proteins (for ex- 
IS ample by purification) or expressed using their nucleic add sequence. 

Especially preferred are restriction endonucleases (restriction enzymes) which have no 
or only a few recognition sequences — besides the recognition sequences present in 
the transgenic recombination construct - in the chromosomal DNA sequence of a par- 

20 ticular eulcaryotic organism. This avoids further double-strand breaks at undesired loci 
in the genome. This is why homing endonucleases are very especially preferred (Re- 
view: (Belfort M and Roberts RJ (1997) Nucleic Adds Res 25: 3379-3388; Jasin M 
(1996) Trends Genet. 12:224-226; Internet: htfcDi/Zrebase.neb. com/rebase/ re- 
base.homlna.htmn. Ovwnq to their long recognition sequences, they have no, or only a 

25 few, further recognition sequences in the chromosomal DNA of eukaryotic organisms in 
most ca^s. 

The sequences encoding for such homing endonucleases can be Isolated for example 
from the diioroplast genome of Chlamydomonas (Turmel M et al. (1993) J Mol Biol 

30 232: 446-467). They are small (1 6 to 26 kD) and their open reading frames (ORF) have 
a "codon usage" which is suiteble directly for nudear expression in eukaryotes (Mon- 
nat RJ Jr et al. (1999) Biochem Biophys Res Com 255:86-93). Homing endonudeases 
which are very espedaily preferably isolated are the homing endonucleases l-Scel 
(WO96/14408), l-Scell (Sarguiei B et al. (1990) l^ludelc Adds Res 16:5659-5665), U 

35 Scelll (Sarguiei B et al. (1991) Mol Gen Genet. 255:340-341), l-Ceul (Marshall (1991) 
Gene 104:241-245), l-Crel (Wang J et al. (1997) Nudeic Adds Res 25: 3767-3776), I- 
Chul (Cote V et al.(1993) Gene 129:69-76), I-Tevi (Chu et al. (1990) Proc Nat) Acad 
Sd USA 87:3574-3578; Bell-Pedersen et al. (1990) Nucleic Adds Res1 8:3763-3770), I- 
Tevll (Bell-Pedersen et al. (1990) Nudeic Adds Re8l8:3763-3770), l-Tevlil (Eddy et al. 

40 (1991) Genes Dev. 5:1032-1041). Endo Seel (Kawasaici et al. (1991) J Biol Chem 
266:5342-5347), l-Cpal (Turmel M et al. (199Sa) Nudeic Adds Res 23:2519-2525) and 
— I-Cpall (Turmel M etal. (199Sb) Mol. Biol. Evol. 12, 533-545). 

Further homing endonudeases are detailed in the abovementioned Internet website. 

45 and examples which may be mentioned are homing endonudeases such as F-Scel, F- 
Sceil. F-Suvl, F-TevI, F-TevH, l-Amai, I-Anll. 1-Ceul, l-CeuAIIP, l-Chul, l-Cmoel, l-Cpal, 
l-Cpall, I-Crel, l-CrepsblP. l-CrepsbliP. l-CrepsblllP, l-CrepsblVP, l-Csml, l-Cvul. I- 
CvuAlP, l-Ddil, i-Ddill. I-Dirl, I-Dmol. I-Hmul, l-Hmull. l-HspNIP, i-Ual. I-Msol. I-Naal, I- 
Nanl. 1-NdlP, l-NgrIP, I-Niti, l-Njal. I-Nsp236lP, I-Paki, l-PbolP. l-PculP. l-PcuAl, I- 

50 PcuVl, l-PgriP, l-PoblP. I-Porl, l-PorllP, l-PpblP, l-Ppol, l-SPBetalP, l-Scal, l-Scel, I- 
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Scell, l-Scelll , l-ScelV, l-SceV. I-Sce\^, l-SceVII, l^exlP, l-SnelP, l-SpomCP, I- 
SpomlP, l-SpomllP, l-SqulP, I-Ssp6803l, l-SthPhiJP, l-SthPhlST3P, l-SihPhiS3bP. I- 
TdelP, I-Tevi, l-Tevll, l-Tevlll, I-UarAP. l-UarHGPA1P, l-UarHGPA13P, l-VinlP, l-ZbilP. 
Pl-MtuI, PI-MtuHIP, Pl-MtuHllP. Pl-Pful, Pl-Pfull, Pl-Pkol, Pl-Pkoll, Pl-Pspl, Ph 
5 Rma43812IP, pl-SPBetalP, Pl-Scel, Pl-Tful, Pl-Tfull. Pl-Thyl. PI-TIH. Pl-Tlill. H-Drel, 
l-Basl. I-Bmol, I-Pogi, l-Twol, Pl-Mgal. Pl-PabI, Pl-PablJ. 

Preferred in this context are the homing endonucleases whose gene sequences are 
already known, such as, for example, F-Scel, l-Ceul, l-Chul, l-Dmol, l-Cpal, I-Cpall, I- 

10 Crel. 1-Csml, F-TevI, F-Tevll, l-TevI, l-Tevll, l-Anil, l-Cvul, l-Ddif, f-Hmul, l-Hmull, l-Llal, 
I-Nani, l^sol, l-Nitl, l-Njal, l-Pakl, l-Porl, l-Ppol, l-Scal, I-Ssp6603l, Pl-Pkol. Pl-Pkoll, 
PI-Pspl. Pl-Tful, PI-TIII. Especially preferred are commercially available homing en- 
donucleases such as l-Ceul, l-Scel. I-Dmol. I-Ppol, PI-Pspl or PI-SceL Endonucleases 
with particularly long recognition sequences, and which therefore only rarely (if ever) 

15 cleave within a genome include: l-Ceul (26 bp recognition sequence), PI-Pspl (30 bp 
recognition sequence), Pi-Scel (39 bp recognition sequence), I-Scel (18 bp recognition 
sequence) and l-Ppol (IS bp recognition sequence). The enzymes can be isolated from 
their organisms of origin in the manner with which the skilled worker is familiar, and/or 
their coding nucleic acid sequence can be doned. The sequences of various enzymes 

20 are deposited in GenBank. Very especially preferred are the homing endonucleases I- 
Scel, i-Cpal, l-Cpall, l-Crel and l-Chul. Sequences encoding said nucleases are known 
In the art and - for example - specified in WO 03/004659 (e.g., as SEQ ID NO: 2, 4, 6, 
8. and 10 of WP 03/004659 hereby incorporated by reference). 



25 In an preferred embodiment, the sequences encoding said homing endonucleases can 
be modified by insertion of an intron sequence. This prevents expression of a functional 
enzyme in procaryotic host organisms and thereby facilitates doning and transforma- 
tions procedures (e.g., based on E.coli or Agrobacterium). In plant organisms, expres- 
sion of a functional enzyme is realized, since plants are able to recogntee and ''splice" 

30 out introns. Preferably, introns are inserted in the homing endonucleases mentioned as 
preferred above (e.g., into l-Scel or l-Crel). 

In some aspects of the invention, molecular evolution can be employed to create an 
improved endonuclease. Polynudeotides encoding a candidate endonudease ertzyme 

35 can, for example, be modulated with DNA shuffling protocols. DNA shuffling is a proc- 
ess of recursive recombination and mutation, performed by random fragmentation of a 
pool of related genes, followed by reassembly of the fragments by a polymerase chain 
reaction-like process. See, e.g., Stemmer (1994) Proc Nati Acad Sci USA 91:10747- 
10751; Stemmer (1994) Nature 370:369-391; and US 5.605,793, US 5,837,458, US 

40 5,830.721 and US 5, 81 1,238. 

Other synthetic endonudeases which may be mentioned by way of example are chi- 
meric nucleases which are composed of an unspedfic nuclease domain and a se- 
quence-specific DNA binding domain consisting of zinc fingers (Bibikova M et ah 

45 (2001) Mol Cell BioL 21:289-297). These DNA-binding zinc finger domaines can be 
adapted to suit any OIMA sequence. Suitable methods for preparing suitable zinc finger 
domaines are described and known to the skilled worker (Beerii RR et aL, Proc Nati 
Acad Sd U 8 A. 2000; 97 (4): 1495-1 500; Beerii RR. et aL, J Biol Chem 2000; 
27S(42):3261 7-32627; Segal DJ and Barfaas CF 3rd.. Curr Opin Chem Biol 2000; 

50 4(1):34-39; Kang JS and Kim JS, J Biol Chem 2000; 275(12):8742-8748; Beerii RR et 
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aL. Proc Natl Acad Sd USA 1998; 9S(2S):14628-14633; Kim JS etal,, Proc Natl Acad 
Sci USA 1997; 94(8): 361 6-3620; Klug A. J Mol Biol 1999; 293{2):21S-21B; Tsal SY et 
af., Adv Drug Deliv Rev 1998;30(1-3):2a-31; Mapp AK et aL, Proc Nati Acad Sd USA 
2000; 97(B):3930-3935; Sharrodcs AD etal., Int J Biochem Cell Biol 1997; 
5 29(12):1371-1387; Zhang L et aL. J Biol Chem 2000; 275(43);33850-33860). 

The endonudease is preferably expressed as a fusion protein with a nudear localiza- 
tion sequence (NLS). This NLS sequence enables fadlitated transport into the nudeus 
and increases ttie efficacy of the recombination system. A variety of NLS sequences 

10 are known to tiie skilled worker and desOTbed, inter alia, by Jicks GR and Raikhel NV 
(1995) Annu. Rev. Cell BioL 11:165-188. Preferred for plant organisms ls» for example, 
the NLS sequence of the SV40 large antigen. Examples are provided in WO 
03/060133. However, owing to the small size of many DSBI enzymes (such as. for ex- 
ample, the homing endonudeases), an NLS sequence is not necessarily required. 

15 These en2ymes are capable of passing through the nudear pores even without any 
aid. 

In a further preferred embodiment, the activity of the endonudease can be induced. 
Suitable methods have been described for sequence-specific recombtnases (Angrand 

20 PO et aL (1998) Nud. Adds Res. 26(13):3263-3269; Logie C and Stewart AF (1995) 
Proc Natl Acad Sd USA 92(1 3): 5940-5944; Imai T et aL (2001) Proc Natl Acad Sd 
USA 88(1):224-228). These methods employ fusion proteins of the endonudease and 
the ligand binding domain for steroid hormone receptor (for example the human andro- 
gen receptor, or mutated variants of the human estrogen receptor as described 

25 therein), induction may be effected vtdth Irgands such as. for example, estradiol, dexa- 
methasone, 4-hydroxytamoxifen or raioxifen. Some endonudeases are adive as 
dimers (homo- or heterodimers; l-Crel forms a homodlmen l-SecIV forms a het- 
erodimerk) (Wemetle CM (1998) Biochemical & Biophysical Research Communica- 
tions 248(1):1 27-333)). Dimerization can be designed as an inducible feature, for ex- 

30 ample by exchanging the natural dimerization domains for the binding domain of a low- 
molecular-weight ligand. Addition of a dimeric ligand then brings about dimerization of 
the fusion protein. Corresponding indudble dimerization methods, and the preparation 
of the dimeric ligands, have been described (Amara JF et aL (1997) Proc Natl Acad Sci 
USA 94(20): 10618-1623; Muthuswamy SK et al. (1999) Mol Cell Biol 19(1D):6845-685; 

35 Schultz LW and Clardy J (1998) Bioorg Med Chem Lett B(1):1-6; Keenan T et aL 
(1998) Biooing Med Chem. 6(8): 1309-1 335). 

Recognition sequences for sequence specific DNA endonudease (e.g., homing en- 
donudeases) are described in the art. "Recognition sequence" refers to a DNA se* 
40 quence that Is recognized by a sequence-specific DNA endonudease of the invention. 
The recognition sequence will typically be at least 10 base pairs long, is more usually 
10 to 30 base pairs long, and in most embodiments, is less than SO base pairs long. 

''Recognition sequence" generally refers to those sequences which, under the condi- 
45 tions in a plant cell used within this invention, enable the recognition and cleavage by 
the sequence specific DNA-endonuclease. The recognition sequences for the respec- 
tive sequence specific DNA-endonudeases are mentioned in Table 5 hereinbelow by 
way of example, but not by limitation. 
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Table 5: Recacfnition sequences and organisms of origin for endonucleases (e.g.| homing en- 
donucleases; indicates the cleavage site of the sequence specific DNA-endonudease within 
a recognitjon sequence). 



uSdI 

Enzyme 


Organl&m 
of origin 


Recognition sequence 


p-Eiemeni 
Trans- 

h053Sb 


Drosophila 


5 -CTAGATGAAATAACATAAGGTGG 


l-Anil 


Aspergillus nidu* 

Ions 


5'-TTGAGGAGGTT^TCTCTGTAAATAANNNNNNNNNNNNNNN 


l-Ddil 


Dictyostelium dis- 

COiaeU mr^A^ 


b'-l i 1 I I I GG1C ATCCAGAAGiAlAi 
»4 -AAAAAALiGMta^ 1 ACaia IC IT CA T A I A 


l-Cvul 


Chlorella vulgaris 


S'-CTGQGTTCAAAACGTCGTGA'^GACAGTrTGG 

It* rtlA^rf^'^A A/iT"!"!" I'/^i^A/SA^AI^Tr* 1 ^A A 

«a-wAwCWAA6l 1 1 1 u1.*AC*lJAC 1 C i GTCAAACC 


l-Csml 


Chlamydomonas 

smitnir 


S"-GTACTAGCATGGGGTCAAATGTCTTTCTGG 


1-Cmoel 


Chlamydomonas- 
moewusii 


S'-TCGTAGCAGCT*CACGGTr 

3 -AGCATwG'*TCGAGTGCCAA 


l-Crel 


Chlamydomonas 
reinnarapi 


S'-CTGGGTTCAAAACGTCGTGA^GACAGTTTGG 
3 -CaACCCAAu 1 1 1 1 uuAG^CAw T C 1 61 CAAACC 


l-Chul 


Chlamydomonas 

numicoia 


S'-GAAGGTTTGGCACCTCG'^ATGTCGGCTCATC 

3 -CTTuCAAACGG I G GAGCTACAGCCC5AG T AG 


i-Cpal 


Chlamydomonas 
palildostigmattcd 


S'-CGATCCTAAGGTAGCGAA^ATTCA 
S'-GCTAGGATTCCATC^GCTTTAAGT 


l-Cpali 


Chlamydomonas 
paiiiaosug maxica 


ff-CCCGGCTAACTC^TGTGCCAG 
3 -GGGwCGA 1 ^ 1 CSAGACACGGTC 


l-Ceul 


Chlamydomonas 
eugametos 


5^-CGTAACTATAACGGTCCTAA''GGTAGCGAA 
3 -GCATTGATATTGCCAG^GATTCCATCGCTT 


l-Dmol 


Desutfuro- 

COCCUS mODIlIS 


5'-ATGCCTTGCCGGGTAA*GTTCCGGCQCGCAT 

3 -TACGGAACGGCC*CATTCAAGGCCGCGCGTA 


l-Scel 


Saocharomyces 

CefeVlSiae 


S'-AGTTACGCTAGGGATAA''CAGGGTAATATAG 

■Al 'f'^^jt A 1 AT'<^^<^A*yA'FI"#"^T'i A^ ' ff' ATa'!'^ 

3 -TCAA 1 QCQATCCC'^TATTGT CCCATTATATC 
S'-TAGGGATAA'^CAGGGTAAT 


I-Sceli 


S.cerevisiae 


fa'- 11 TTGATTCI 1 1 GG 1 CACCC^ 1 GAAGTATA 

V> -MAAMw 1 MAwMAAwwAw ■ UUUAu* 1 1 l^M 1 A 1 






4g*_ ATTfSC^ AfSft i i l ie; f^TA AI**aTATTTATTA C* 

3'-TAACCTCCAAAACC^ATrGATAAATAATGG 


l-ScelV 


S.cerevisiae 


S'-I CnTTCTCI IGAHA'^GCCC iAATCTACG 
3 -AGAAAAG AGAAC^TAATC GGGATTAGATGC 


l-SceV 


SIcerevlsiae 


S'-AATAATTTTCT^TCTTAGTAATQCC 
3'-rrArrAAAAGAAGAATCATTA'K:6G 


l-SceVi 


S.cerevisiae 


S'-GTTA'i 1 lAA IG^'TTl lAGlAGI IGG 
3'-CAATAAATTACAAAATCATCA'^ACC 


l-SceVll 


S.cerevisiae 


S'-TGTCACATTGAGGTGCACTAGTTATTAC 


Pl-Scel 


S.cerevisiae 


S'^TCTATGTCGGGTGC^GAGAAAGAGGTAAT 
a'-TAGATACAGCC^CACGCCTCTTTCTCCATTA 
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DSBl 

Enzyme 



F-Sce! 



F-Scell 



l-Hmul 



l-Hmull 



hNani 



l-^fti 



l-NJal 



i-Paki 



l-Port 



l-Ppol 



l-Scel 



Organism 
of origin _ 



S.cerevisiae 



S.cerevislae 



Bacillus subtilis 
bacteriophage 

SP01 

Bacillus subtilis 
bacteriophage 

SP62 



Lactococciis lactis 



Recognition sequence 



5'-GATGCTGTAGGC*ATAGGCTTGGTT 
3'-CTACGACA^TCCGTATCCGAACCAA 
F-CTTTCCGCAACA'^GTAAAATr 
a^GAAAGGCG^TTGTCATTTTAA 



S'-AGTAATGAGCCTAACGCTCAGCAA 
S'-TCATTACTCGGATTGCHSAGTCGTT 



S'-AGTAATGAGCCTAACGCTCAACAANNNNNNNNNNNNNNNN- 
NNNNNNNNNNNNNNNNMNNNNNN 



Monomastix 

ides 



Naegleria ander- 
soni 



Naegleria italica 



Naegleria 
I iamiesoni 

r seiiaefiauuitjiiiUi ■ ■ 

akinefum 



S'-CACATGCATAAC^ATATCATmr 
3'-GTGTAGGTATTGGTATAGTAA^AAA 



S'-CTGGGTTCAAAACGTCGTGA^ACAGTTTGG 
3'-GACCCAAGTTTTGCAG^ACTCTGTCAAACC 



S"-AAGTCTGeTGCCA*GCACCCGC 
3'-TTCAGACC^ACGGTCGTGGGCG 



6'-AAGTCTGGTGCCA*GCACCCGC 
3'^•TTCAGACC^^CGGTCGTGGGCG 



S'-AAGTCTGGTGCCA^CACCCGC 
3'«TTCAGACC^ACGGTCGTGGGCG 



I-Ssp6803i 



Pyiobaculum or- 
qanotrophum 



Physarum 
cephalum 



poly- 



«ri /^T/irartTT-r^AAAArtftTrQTrtA'^GACAGTrTGG 
a'-GACCCAAGTTTTGCAG'^CACTCTGTCAAACC 

^ .GCGAGCCCGTAAQGGT^GTGTACGGG 
3'-CGCTCGGGCATT^CCACACATGCCC 



S'-TAACTATGACTCTCTTAA'^QGTAGCCAAAT 
3'^ATTGATACTGAGAG^AATTCCATCGGTTTA 



Saccharomyces 

capensis 



S'-TGTCACATTGAGGTGCACT'^AGTTATTAC 
3'-ACAGTGTAACTCCAC^GTGATCAATAATG 



Synechocystis 
species 



ff-GTCGG GCT'^ATAACCCGAA 
3'-^GCCCGAGTA^TTGGGCTT 



Pl-Pful 



Pyrxjcoccus furi- ff-GAAGArGGGAGGAGGG^ACCGGACTCAACTT 
OSUS VCI I y- crrrCTACCCTCC^TCCCTGGCCTGAGTTGAA 



Pl-Pfull 



Pyrococcus furv s'-acqaatccatgtggaga^agagcctctata 
OSUS Vci 1 3 -tgcttaggtacac^ctcttctcggagatat 



Pl-Pkol 



Pyrococcus koda- S'-GATTTTAQAT'^CCCTGTACC 
karaensis K0D1 1 3'-ctaaAA^TCTAGGGACATGG 



Pl-Pkoll 



Pyrococcus koda- j S'-CAGTACTACG^GTTAC 
karaensis KOD1 I s'-gtcaTG^atg ccaatg 



Pl-Pspl t Pyrococcus sp. 



Pl-Tful 



S'-AAAATCCTGeCAAACAGCTATTAT^GGGTAT 
aMTTTAGGACCGTTTGTCGAT WiTACCCATA 
Thermococcus | ff-TAGATTTTAGGT^CGCTATATCCTTCC 

fiimicolans STSS7 | 3'-atctaaAA^tccagcgatataggaagg 



PI-TfuU 



Thermococcus I S'-TAYGCNGAYACKPGACGGYTTYT 

I himicolans STSS7 3'-atrcgnct-rtgnctgccraara 
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DSBl 
Enzyme 


Organism 
of origin 


Recognition sequence 




hydrothermalis 


S'-ATRCGNCTMiTGNCTGCCRAARA 


■ • I III 


III Wl t • lUWUIiAi^lJCV 

litoraHs 


a'-ATRCGNCTRTGNCTGCCRAARA 


Pl-Tlill 


Thenmococcus 

ntoralis 


S'-AAATTGCTTGCAAACAGCTATTACGGCTaT 


l-Tewl 


Bacteriophage T4 


S'-AGTG6TATCAAC*GCTCAQTAGATG 
3'TCACCATAGT*TGCGAGTCATCTAC 


i-Tevll 


Bacteriophage T4 


S'-GCTTATGAGTATGAAGTGAACACGT^TATTC 
3'-CGAATACTCATACTTCACTTGTG*CAATAAG 


F-TevI 

t 


Bacteriophage T4 


5'-GAAACACAAGA''AATGTTTAGTAAANNNNNNNNNNNNNKl 

3'-CTTTGTGTTCTTTACAAATCAl"TrNNfaNNfsJNNNNNNNN'^ 


F-Tevll 


Bacteriophage T4 


S'.TTTAATCCTCGCTTC^AGATATGGCAACTQ 

S'-AAATTAGGAGCGA^AGTCTATACCGTTGAC 


H-Drel 




S'-CAAAACGTC GTAA'^GTTCCGGCGCG 
3'-GTTTTGCAG*CATTCAAGQCCGCGC 




ingiensis phage 


3'. TCATTACGAGTCGAACTCGGATTG 


l-Bmol 


Bacillus mojavan" 
s/5s67-18 


ff-GAGTAAGAGCCCG^TAGTAATGACATGGC 
a'-CTCATTCTCG^GGCATCATTACTGTACCG 




*Jf* WG3 WUf Iff ff f 


fi-CTTCAGTAT*GCCCCGAAAC 

3'.gaagt'm::atacggggctttg 


l-Twril 

I"" 1 wVUI 


Sts ohs/lt^f^G^iis. 

autdus phage 


S-TCTTGCACCTACACAATCCA 
3'-AGAACGTGGATGTGTTAGGT 


Pl-Mgal 


Mycobacterium 
gastri 


S'-CQTAGCTGCCCAGTATGAQTCA 
3'-GCATCGACGGGTCATACTCAGT 


Pl-PabI 


Pyracoccus abyssi 


S'-GGGGGCAGCCAGTGGTCCCGTT 
3'-CCCCCGTCGGTCACCAGGGCAA 


Pl-Pabll 


Pytococcus abyssi 


5*-ACCCCTGTCGAGAGQAGCCCCTC 
3'-TGGGGACACCTCTCCTCGGGGAG 



Also encompassed are minor deviations (degenerations) of the recognition sequence 
which still enable recognition and cleavage by the sequence specific DMA- 
endonuclease in question. Such deviations - also in connection with different frame- 
5 work conditions such as, for example, calcium or magnesium concentration — have 
been described (Argast (3M et al. (1998) J Mol Biol 280: 345-353). Also encompassed 
are core sequences of these recognition sequences and minor deviations (degenera- 
tions) in there. It is known that the inner portions of the recognition sequences sufRce 
for an induced double-strand break and that the outer ones are not absolutely relevant, 
10 but' can codetermine the cleavage efficacy. Thus, for example, an 18bp core sequence 
can be defined for l-Scel. 
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lil,4. Combination with other recombination enhancing techniques 

In a further preferred embodiment, the efficacy of the recombination system is in- 
creased by combination with systems which promote homologous recombination. Sudi 
systems are described and encompass, for example, the expression of proteins such 
6 as RecA or the treatment with PARP Inhibitors, It has been demor^trated that the In- 
trachromosomal homologous recombination in tobacco plants can be increased by 
using PARP inhibitors (Puchta H et al. (1985) Plant J. 7:203-210). Using these inhibi- 
tors, the homologous recombination rate in the ONA constructs of the invention after 
induction of the sequence-specific ONA double-strand break, and thus the efficacy of 

10 the deletion of the transgene sequences, can be increased further. Various PARP in- 
hibitors may be employed for this purpose. Preferably encompassed are inhibitors such 
as 3-aminobenzamide, 8-hydroxy-2-methylquinazolln-4-one (NU1025), 1 , 11 b-dlhydro- 
[2H3benzopyrano[4,3,2-de]isoquinolln-3-one (GPI 6150), S-aminoisoquino-linone, 3,4- 
dihydro-5-[4-(1-piperidinyl)butoxy]-1(2iH)HsoquInolinone, or the compounds descrii^ed 

15 in WO 00/26192, WO 00/29384. WO 00/32579, WO 00/64878. WO 00/68206, WO 
00/67734, WO 01/23386 and WO 01/23390. 

In addition, it was possible to increase the frequency of various homologous recombi- 
nation reactions in plants by expressing the E. coli RecA gene (Reiss B et al. (1 996) 

20 Proc Natl Acad Sci USA 93(7): 3094-3098). Also, the presence of the protein shifts the 
ratio between homologous and illegitimate DSB repair in favor of homologous repair 
(Reiss B et al. (2000) Proc Natl Acad Sci USA 97(7):335a-3363). Reference may also 
be made to the methods described in WO 97/08331 for increasing the homologous 
recombination in plants. A further increase in the efficacy of the recombination system 

25 might he achieved by the simultaneous expression of the RecA gene or other genes 
which increase the homologous recombination efficacy (Shalev G et al. (1999) Proc 
Natl Acad Sci USA 96(13):7398-402). The above-stated systems for promoting ho- 
mologous recombination can also be advantageously employed in cases where the 
DIMA construct of the invention Is to be introduced in a site-directed fashion into the 

30 genome of a eukaryotic organism by means of homologous recombination. 

Hi.S initiation of Deletion / Excision 

There are various means to appropriately initiate deletion / excision of the dual-function 
maricer. Preferably deletion is only initiated after the dual-function marker has success- 
35 fully completed its function has negative selection marker resulting in insertion of the 
DNA construct of the invention into the genome of the cell or organism to be trans- 
formed. 

Various means are available for the person skilled in art to combine the dele- 
40 tion/exdsion inducing mechanism with the DNA construct of the invention comprising 
the D-amino oxidase dual-function Iselection maricer Preferably, a recombinase or en- 
donuclease (hereinafter together also "excision enzyme") employable in the method of 
the invention can be expressed or combined with its corresponding recombination or 
recognition site, respectively, by a method selected from the group consisting of: 

45 

a) incorporation of a second expression cassette for expression of the excision enzyme 
(the recombinase or sequence-specific endonuclease) operably linked to a plant 
promoter into said DNA construct, preferably together with said first expression cas- 
sette flanlced by said sequences which allow for specific deletion, 
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b) incorporation of a second expression cassette for expression of the excision enzyme 
(the recombinase or sequence-specific endonuclease) operably linked to a plant 
promoter into the plant cells or plants used as target material for the transformation 
thereby generating master cell lines or cells. 

c) incorporation of a second expression cassette for expression of the excision enzyme 
(the recombinase or sequence-specific endonuclease) operably linked to a plant 
promoter Into a separate DMA construct, which is transfomned by way of co- 
transformation with said first DNA construct Into said plant cells or transformed into 
cells already comprising said first DNA construct. 



Accordingly the first DNA construct of the Invention and the excision enzyme (e.g., the 
recombinase or endonuclease) can be combined in a plant organism, cell, cell com- 
partment or tissue for example as follows: 



1 . ) Plants which have the first DNA constmct inserted into their genome (preferably 

Into the chromosomal DNA) are generated in the customary manner utilizing the 
dual-function marker as negative selection marker (for example, by Agrobacteria- 
mediated transformation). A second expression cassette for the excision enzyme 
20 is then combined with said DNA constructs by 

a) a second transformation with said second expression cassette, or 

b) crossing of the plants comprising the first DNA construct with master plants 
25 comprising the expression cassette for the excision enzyme. 

2. ) The expression cassette encoding for the excision enzyme can be Integrated Into 

the DNA construct which already bears the expression cassette for the dual- 
function martcer. It Is preferred to insert the sequence encoding the excision en- 

30 zyme between the sequences allowing for deletion and thus to delete It from the 

genomic DNA after it has fulfilled its function. Very especially preferably, expres- 
sion of the endonuclease is Inducible In such a case (for example under the con- 
trol of one of the inducible promoters described hereinbelow). in a development- 
dependent fashion using a development-dependent promoter, or else excision 

35 enzymes are employed whose activity is inducible in order to avoid premature de- 

letion of the dual-function marker prior to its insertion into the genome. 

4. ) Relying on the co-transformation technique, the expression cassette which en- 

sures the expression of the excision enzyme can be transformed into the cells 
40 simultaneously with the first DNA construct, but on a separate vector. Co- 

transformation can be in each case stable or transient In such a case, expres- 
^ sion of the excision enzyme is preferably inducible (for example under the control 
of one of the inducible promoters described herelnbelow), in a development- 
dependent fashion using a development-dependent promoter, or else excision 
45 enzymes are employed whose activity Is inducible in order to avoid premature de- 

letion of the dual-function marker prior to its insertion into the genome. 

5. ) Plants expressing the excision enzyme may also act as parent individuals. In the 

progeny from the crossing between plants expressing the excision enzyme on the 
SO one hand and plants bearing the first DNA constmct on the other hand, the de- 
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sired marker deletion (e.g., by double-strand breaks and recombination between 
the homology sequences) are observed. 

6. ) Expression of the excision enzyme is also conceivable in a transient transforma- 
5 tion approach in v\^tch the possibilities 2 to 4 can be exploited, 

7. ) The exdsion enzyme can also be introduced into cells comprising or bearing the 

transgenic recombination construct directly, for example via microinjection, parti- 
cle bombardment (biolistic method), polyethylene glycol transfection or liposome- 
10 mediated transfection. This embodiment Is advantageous since no excision en- 

zyme-encoding sequences remains In the genome. Such a method has been de- 
scribed for example by Segal DJ et al. (1995) Proc Natl Acad Sd USA 92:806- 
610. 

15 B.) The excision enzyme may also be generated by introducing the exdsion enzyme 

encoding, in-vrtro-generated mRNA into cells (for example via microinjection, 
partide bombardment (biolistic method) or liposome-mediated transfection). This 
embodiment is advantageous since no exdsion-enzyme-encoding sequences will 
remain in the genome. 

20 

9.) The excision enzyme can be introduced into plant cells as a fusion protein with 
the VirE2 or VirF protein of an Agrobacterium. Such methods have been de- 
scribed for example for Cre recombinase (Vergunst AC et ai. (2000) Science. 
290: 979-982). If the expression cassette for the fusion protein is located outside 
25 the border sequences, it is not inserted 

As described above, the exdsion enzyme ran be generated using an expression cas- 
sette which comprises the DNA encoding an excision enzyme and is introduced into a 
plant cell or organism. In tills context, the expression cassette for the excision enzyme 
30 preferably comprises a nucleic add sequence encoding an exdsion enzyme. Various 
suitable cassettes are described in WO 03/004659. 

A preferred embodiment of the invention is related to DNA constructs comprising both 
the expression cassette for the dual-fundion marker (the first expression cassette) and 
35 a second expression cassette for the excision enzyme (e.g., an endonudease or re- 
combinase encoding sequence linked to a plant promoter), preferably In a way that 
said second expression cassette is together with said first expression cassette flanked 
by said sequences which allow for specific deletion* 

40 In another pmferred embodiment the mechanism of deletion/excision can be induced 
or activated in a way to prevent pre-mature deletion/exdsion of the dual-function 
marker. Preferably, thus expression and/or adivity of an preferably employed excision 
enzyme can be induced, preferably by a method selected from the group consisting of 

45 a) inducible expression by operably linking the sequence encoding said excision en- 
zyme (e.g., a recombinase or endonudease) to an inducible promoter, 

b) Inducible activation, by employing a modified excision enzyme (e.g., a recombi- 
nase or endonudease) comprising a ligand-binding-domain, wherein activity of 
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said modified excision enzyme can by modified by treatment of a compound hav- 
ing binding activity to said ligand-binding-domain. 

Expression of the polynucleotide encoding the exdsion enzyme is preferably controlled 
5 by an excision promoter, which allows for expression in a timely manner so that the 
dual-function marker can perform its function as a negative selection marker before 
getting excised. Suitable promoters ara for example described in the German Patent 
Application DE 03028884.9. Such promoters may have for example expression speci- 
ficity for late developmental stages like e.g., reproductive tissue. The excision promoter 
10 may be selected from one of the following groups of promoters: 

a) Pollen-specific promoters such as, for example, the promoter of the B. campestris 
bgp1 gene (GenBank Acc.-No: X68210; Xu H et al. (1993) Ma\ Gen Genet 239(1- 
2):S8-6S; WO 94/13809), of the Oryza satlva ory s 1 gene (GenBank Acc.-No.: 

15 AJ012760; Xu H et al, (199S) Gene 164 (2):2S5-2S9), of the pollen-specific maize 
gene ZM13 (Hamilton DA et al. (1998) Plant Mol Biol 38(4):663-669; US 5,086,169), 
and of the B.napus gene BpIO (GenBank Acc.-No.: X64257; Albani D (1992) Plant 
J 2(3):331-342; US 6.013,859). The promoter of the potato invGF gene from potato 
(Plant Mol. BioL, 1999, 41:741-751; EM8L Acc No. AJ13376S; especially prefen^ed 

20 is the promoter described In German Patent Application DE 03028884.9 by SEQ ID 
NO: 1). The Lcgl promoter (WO 99/05281; XU H et al. (1999) Proc. Natl. Acad- Sci. 
USA Vol 96:2554-2558). 

b) Promoters active in ovules (ue. egg cells) 

25 The promoter of the Arabidopsis AtSERKI gene (Somatic Embryogenesis Receptor- 
Like Kinase 1; At1G71830: Hedht et aL (2001) Plant Physiol 127:803416). Espe- 
cially preferred Is the promoter deacribed in German Patent Application DE 
□3026684.9 by SEQ ID NO: 2. 

30 c) Promoters active in zygotes 

The promoter of the Arabidopsis gene AtcycIA (Cyciin cyc1 gene, type cydin B; 

At4g37490; Plant Cell 6: 1763- 1774 (1994)). Especially preferred is the promoter 

described in German Patent Application DE 0302B884.9 by SEQ ID NO: 5. 

USP promoter from Vida f^ba (Baumlein H et ai. (1991) Mol Gen Genet 225:459> 
35 467; Fiedler U et al. (1993) Plant Mol Biol 22:669-679). Especially preferred is the 

promoter described in German Patent Application DE 03028884.9 by SEQ ID NO: 3. 

The USP promoter has further activity also in early immature embryos, 

d) Promoters active in meristems ~ 
40 The promoter of the gene erecta (Acc. No. D832S7) from Arabidopsis is active in 
menstematic cells (Yokoyama et a!., 1998, Plant J. 15: 301-310). Especially pre- 
ferred is the promoter described in German Patent Application DE 03028a&4.9 by 
SEQ ID NO: 4. 

45 Altematively, an inducible promoter (which may have ubiquitous expression activity 
when induced) can be employed with the application of the corresponding inducer at 
the appropriate time point. Preferably, the Inducer for the inducible promoter is applied 
together with (or briefly before) application of the compound M (e.g., D-isoleucine or D« 
valine) which is converted by action of the dual-function DAAO marker into a phytotoxic 

50 compound. 
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The term 'Inducible" as applied to a promoter Is well understood by those skilled in the 
art. In essence, expression under the control of an inducible promoter is "switched on" 
or increased in response to an applied stimulus (which may be generated within a cell 
or provided exogenously). The nature of the stimulus varies between promoters. What- 
5 ever the level of expression is in the absence of the stimulus, expression from any in- 
ducible promoter is increased in the presence of the correct stimulus. The preferable 
situation is where the level of expression increases upon in the presence of the rele- 
vant stimulus by an amount effective to alter a phenotypic characteristic 0-e. to express 
a DAAO and modify tolerance of a D-amino acid). Thus an inducible (or "switchahle'') 

10 promoter may be used which causes a basic level of expression in the absence of the 
stimulus which level is too low to bring about the desired D-amino acid tolerant or sen- 
sitive phenotype (and may in fact be zero). Upon application of the stimulus, expres- 
sion is increased (or switched on) to a level that causes enhanced D-amino add toler- 
ance or sensitivity. Many examples of inducible promoters will be Icnown to those 

15 sicilled in the art. 

The inducer can be a physical stimulus like light, heat, droMght (low moisture), wound- 
ing etc. However, preferably, the inducer Is an externally applied chemical substance. It 
is preferred that the inducible excision promoter only causes functional expression of 
20 the endonuclease operably linked if this chemical inducer is externally applied. This 
leads to a controlled, governable expression and deletion. 

Inducible and repressible promoters have been developed for use in plants (Rewiew: 
Gatz. Annu Rev Plant Physiol Plant IWIol Biol 1997, 4S:B9-10B), based on - for example 

25 - bacterial repressor (Gatz C & Quail PH (198B) Proc. Natl Acad. Sci. USA 85:1394- 
1397). animal steroid (Aoyama T & Chua IslH (1997) Plant J. 11:606-612; Martinez A et 
aL (1999) Plant J. 19:97-106) or fungal regulatory elements (Caddick MX et al. (1998) 
Nature Biotechnol 16:177-180). Promoter systems that are positively regulated by 
chemical ligands (inducible systems) Include the tetracycline(doxycycline)-induced 

30 Triple-Op' promoter (Gatz C & Quail PH (19BB) Proc Natl Acad Sd USA 85:1394-1397; 
Gatz C et aL (1991) Mol Gen Genet 277:229-237; Gatz C et al. (1992) Plant J. 2:397- 
404), the glucocorticoid-inducible 'GAL4-UA& promoter (Aoyama T & Chua NH (1997) 
Plant J. 11:606-612), the ecdysone-inducible •GRHEcR promoter (Martinez A et al. 
(1999) Plant J, 19:97-106) and the ethanol-inducible 'alcA promoter (Caddick MX et al. 

3S (1998) Nature Biotechnol 16:177-180). Hormones that have been used to regulate 
gene expression include, for example, estrogen, tomoxifen, toremifen and ecdysone 
(Ramkumar and Adier (1995) Endocrinology 136: S3S-S42). See, also, Gossen and 
Bujard Proc. Watl. Acad. Sd. USA 69: SS47 (1992); Gossen et al. Science 268: 1766 
(1995). In tetracycline-Inducible systems, tetracycline or doxycycllne modulates the 

40 binding of a repressor to the promoter, thereby modulating expression from the pro- 
moter. 

Inducible expression system can be distinguished into positively and negatively 
regulated systems. For positively regulated system, expression is induced by adding 

45 the corresponding inducer, for negatively regulated systems expression is induced by 
removing the Inducer (better named repressor in this case). An example for a 
negatively regulated (repressible) system is the tetracycline-inaclivated TopIO' 
promoter and derivatives (Bohner S et al. (1999) Plant J. 19. 87-95; Weinmann P et al. 
(1994) Plant J 5:559-669). The TopIO promoter sequence contains a tandem repeat of 

50 seven copies of the TnIO tet operator (tet-OP) DMA sequence that tightly bind the 
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tetracycline repressor polypeptide TetR (LedererT et al. (1995) Anal Biochem 232:190- 
196). This element is fused to a truncated vei^ion of e-g., the CaMV 35s promoter 
(nucleotide positions -53 to 0). The Top1Q promoter sequence Is recognized by a 
transactrvator that effectively acts as an artifidal transcription factor. The transactivator 
5 Is a chimeric protein fusion between amino acids 1-207 of TetR (Postle K et at. (1984) 
Nud Adds Res 12:4B49- 4963) and amino acids 363^90 of the transcriptional 
activation domain (VP1B) from the Herpes simplex virus (Triezenberg SJ et al. (1988) 
Genes Dev. 2:718-729), and is labelled TetRA/P16* or 'tTA (tetracycline transactivator). 
In the absence of tetracycline, the TetR portion of the tTA binds the tet-OP DMA 

10 sequences within the TopIO promoter with high affinity (Hinrichs W et al. (1994) 
Science 264:418-^20; Lederer T et aL (199S) Anal Biochem 232:190-196; Lederer T et 
al. (1996) Biochemistry 35:7439-7446). This interaction positions the VP16 domain of 
the tTA in close proximity to the TopIO promoter TATA box. enabling transgene 
transcription. However, in the presence of tetracycline, the TetR undergoes a 

IS confomiatlonal change (Hinrichs W et al. (1994) Science 264:418-420; Orth P et aL 
(199B) J Mol Biol 279: 439-447) that lowers its affinity for the TopIO promoter to non- 
specific binding levels (Lederer T et aL (1996) Biochemistry 35:7439-7446). 
Consequently! tTA binding to the TopIO promoter Is inhibited, and transcription is 
switched off. Use of the TopIO promoter system is particufarty advantageous in plants. 

20 First the TopIO promoter is not functional in the absence of the tTA, Second, transcrip- 
tional control is stringent, and tightly controlled by tetracycline. Third, tetracycline has 
no naturally occurring analogue in plant cells, which might othenwise interfere with 
promoter regulation. Fourth, the levels of tetracycline used to repress the TopIO 
promoter are extremely low, normally of the order of 1 |jg/ml, and have no discemible 

25 secondary effect on plants (Weinmann P et aL (1994) Plant J 5:SS9«S69). Rnally. 
coupling the two transformations required for promoter function can be achieved by 
transforming the same plants first with the 3SS:^TA piasmid construct and then with the 
TopIO promoter driving the gene of interest, or by mating transgenics which have 
independently been transformed with the appropriate constructs. The TopIO promoter 

30 has been successfully used in Nicotiana sp. (Weinmann P et aL (1994) Plant J 5:559- 
569) and in the moss PhyscomitrellB patens (Zeidler M et al. (1996) Plant Mol Biol 
30:199'20S)- 

Aitematively, a positively regulated tetracyclin based inducible expression system can 
35 be employed. Especially preferred is the inducible reverse tetracycline system, which 
allows expression to be up-regulated only upon addition of tetracyclin or a lipid-soluble 
derivative of tetracycline, doxycyclin (dox, Gossen M. et aL (1995) Science 268:1766- 
1769; Jiang DM etaL (2001) J. Neurochem. 76(6); 1 745-1 7SS), 

40 Inducible promoters that are directly responsive to physiologically active stimuli such as 
heat-shock (Prandl R et aL (1995) Plant Viol BioL 28:73-82; 1995; Seyerin K & 
SchoeffI F (1990) Plant WoL BioL 15:827-834). stress signalling molecules (Suehara Kl 
et aL (1996) J. Femi. Bioeng. 82, 51-55) or heavy metals (McKenzie, MJ., et aL (1998) 
Plant PhysioL 116,969-977) may also be employed. However, diemlcally indudble 

45 promoter systems are preferred. 

Indudbel expression systems have been used in several plant species, including 
tobacco (Gatz C et aL (1991) MoL Gen. Genet 277:229-237), potato (Kumar A et aL 
(1996) Plant J. 9:147-158), tomato (Thompson AJ & Wlyatt SC (1997) Plant MoL BioL 
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34:687-692) and Arabidopsls tiialiana (Aoyama T & Chua NH (1997) Plant J. 11:605- 
612). 

An additional example indudee tlie ecdysone responsive element (No et ai., Prac. Natl. 

5 Acad. Sd. USA 93: 3346 (1 997)). Other examples of inducible promoters include the 
glutathione-S^nsferase 11 promoter which is specifically induced upon treatment with 
chemical safenera such as N, N-dlallyl-2,2-dichloroacetamide (PCT Application Nos. 
WO 90/06626 and WO 93/01294) and the alcA promoter from Aspergillus, which in the 
presence of the alcR gene product is induced with cydohexanone (Loddngton, et al., 

10 Gene 33: 137-149 (1985); Felenbok, et al. Gene 73: 365-396 (1988); Gwynne, et al. 
Gene Si: 205-216 (1987)) as well as ethanol. Chemical inducers of promoters can be 
combined vwth other active chemicals or inert carriers prior to application to an orgarv- 
ism. For example, other agronomically useful chemical compositions such as pesti- 
ddes or fertilizers as well as carriers and solvents can be combined with the inducer, 

IS 

Further examples for indudble promoters indude the PRP1 promoter (Ward et al., 
Plant Mol. Biol. 22 (1993). 361-366). a salicylic-add-indudble promoter (WO 
95/19443). a benzenesulfonamide-indudble promoter (EP-A-03681B6), a tetracydin- 
indudble promoter (Gatz et al.. (1992) Plant J. 2. 397-404). an absdsic add-indudfale 
20 promoter (EP-A 33SS28). a salicylic acidHndudbie promoter (WO 95/1 9443) or an 
ethanol- (Salter MG et al. (1998) Plant J. 16:127-132) or cydohexanone-indudble (WO 
93/21334) promoter may likewise be used. 

Other prererrea promoiers are promoiers inuuwcu uy "iwuw wi auiwuw — , 

25 for example, the pathogen-indudble promoter of the PRP1 gene (Ward et aL, Plant Mol 
Biol 1993, 22: 361-366), the tomato heat-inducible hspBO promoter (US 5,187,267), the 
potato ehill-inducible alpha-amylase promoter (WO 96/12814) or the wound-induced 
pinll promoter (EP37S091). 

30 In an espedally preferred embodiment, the exdslon enzyme is expressed under the 
control of an indudble promoter. This leads to a controlled, governable expression and 
deletion - for example in plants -. and any problems caused by a constitutive expres- 
sion of an excision enzyme are avoided. 

35 Obviously, also the promoter controlling expression of the agronomically valuable trait 
or selecUon marker gene may be selected from the promoters preferred as exdsion 
promoters. 

111.6 Optional Methods of Preventing Premature Excision of the Excision Con- 

40 struct . . 

It is useful to have a system to maintain the dual-fundlon martoer compnsing construct 
of the invention espedally during Iransfonnation and seledion. In genera!, a control 
polynudeotide can be introduced into the ONA-construct encoding for the excision en- 
zyme to achieve this goal. The control polynudeotide generally functions either to in- 

45 hibit expression of the exdsion enzyme vwhen inhibition is desired (e. g., during trans- 
formation and selection; for preferred time frames see above) or to release repression 
of the exdsion promoter, thus allowing for expression from the exasion promoter. 
Those of skill will recognize that there are numerous variations for controlling or pre- 
venting expression of the excision enzyme in a particular cell or tissue or at a particular 

SO developmental stage. 
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In one aspect, expression from the first excision promoter (i. e. the promoter operabiy 
linked to the a first excision enzyme, which excises the dual-function marker) can be 
countered by a second. no-exdsion promoter. For example, the second no-excision 
promoter can be operabiy linked to a repressor gene, which, when expressed, prevents 
es^ression of the first excision promoter. Examples of repressor include the tet and 
lac repressors (Gatz, et al. (1991) Mol Gen Genet 227:229-237). The second no- 
exasion promoter is preferably a promoter which has the highest activity in the tissue 
used for transformation / selection but has low activity in the reproductive cell (s g 
pollen or oocyte), a precursor cell or tissue of said reproductive cell, or an omnlpoterit 
cell (e.g. zygote) resulting from reproduction. Also an inducible promoter can be em- 
ployed and induction is used during the transfomiation / selection phase. Such an In- 
ducible promoter can be for example a tetracycline (doxycycline) -Inducible system 
Which Is induced by tetracycline or doxycycline (see above). Antibiotics like this can be 
employed during transformation / selection. 

Alternatively, the second no-excision promoter can be linked to the polynucleotide en- 
coding the endonudease In the opposite orientation of the first exdsion promoter (i e 
f^om the 3'-end of the coding sequence towanJs the 5'-end of the sequence), thereby 
interrupting expression of the DMA cleaving enzyme. In these embodiments, the tran- 
scriptional acQvity of the second no-exdsion promoter prevents completion of tran- 
scnpts from the first excision promoter, thereby preventing expression of the exdsion 
enzyme. 



in other embodiments, an antisense polynudeotide or a polynudeotide producing a 
26 double-stranded RNA molecule can be operabiy linked to the second no-excision pro- 
moter, thereby preventing the translation of the DNA deaving enzyme mRNA. See. e 
g., Sheehy et al. (19BB) Proc Natl Acad Sd USA 85:8805-8809, and US 4,801 .34D*for 
a description of antisense technology; and EP-A1 1 042 4S2, EP-Al 1 068 311 for a 
description of the double-stranded RNA interference technique. The antisense or dou- 
30 ble-stranded RNA molecule should have homology to the nucleofide encoding the ex- 
cision enzyme to guarantee effident suppression. In general, antisense technology 
involves the generation of RNA transcripts that hybridize to a target transcript (i.e., the 
transcript encoding the sequence-specific endonudease). Alternatively, the second no- 
exdsion promoter can be operabiy linked to a DNA deaving enzyme polynudeotide in 
35 the sense orientation to induce sense suppression of the gene (see, e.g. Napoli et al 
(1990) Plant Cell 2:279-289, US S.034,323, US 5.231.020, and US 5,283.134 for a 
description of sense suppression technology). 

In some embodimente, aptamer technology can be used to repress expression of fte 
40 first ^cision promoter. See, e. g., Hemnann et al. (2000) Sdence 2a7(54S4):820-S; and 
Famulok et al. (1999) Curr Top Microbiol Immunol 243:123^6. For example, a small 
oligonudeotide could be developed that only binds and represses the first exdsion 
promoter when stabilized by a pari:icular chemical which can be applied vwhen trans- 
genic seed are desirad. For example, combinatorial library selections through the sys- 
45 tematic evolution of ligands by exponential enrichment (SELEX) technique can be used 
to identHy nudeic acid aptamers that bind with high affinity and specificity to a wide 
range of selected molecules. See. e. g., Conrad et al. (1995) Mol Divers 1(1)-69-78: 
and Kusser (2000) J Biotechnol 74(l):27-38. 
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In some embodiments, a multi-tiered excision system Is used. For example, the first 
excision promoter can be interrupted by a second recombination cassette. This second 
recombination cassette may again be flanked by a second set of homology sequences 
B and B' flanking a chemicaltyHlnduced promoter operabiy linked to a polynucleotide 

5 encoding a second sequence-spedfic DMA cleaving enzyme. In general, this system 
allows for the transgenic construct to remain intact in the genome (e.g., during trans- 
formation and selection) as long as the chemical inducer Is not provided. Once the 
chemical inducer is presented, the second DMA deaving enzyme is induced and ex- 
dses its own coding region, induces homologous recombination between B and B', 

10 thereby reconstituting the first excision promoter to an intact promoter. Since B remains 
after exdsion, B and B' are preferably a sub-sequence of said first exdsion promoter. 



IV. Additional elements in the DNA constructs of the invention 

The DNA construct may - beside the various promoter sequences - comprise addi- 

15 tlonal genetic control sequences. The term "genetic control sequences" is to be under- 
stood in the broad sense and refers to all those sequences which affect the making or 
function of the DNA construct to the invention or an expression cassette comprised 
therein. Preferably, an expression cassettes according to the invention encompass 5'- 
upstream of the respective nucleic acid sequence to be expressed a promoter and 3 - 

20 downstream a terminator sequence as additional genetic control sequence, and. if ap- 
propriate, further customary regulatory elements, in each case in operable linkage with 
the nudeic add sequence to be expressed. 

Genetic control sequences are described, for example, in "Goeddel; Gene Expression 
25 Technology: Methods in Enzymology 185, Academic Press. San Diego, CA (1990)" or 
"Gruber and Crosby, in: Methods in Plant Molecular Biology and Biotechnolgy, CRC 
Press, Boca Raton, Florida, eds.: Glick and Thompson, Chapter 7, 89-108" and the 
references cited therein. 



30 Examples of such control sequences are sequences to which inductors or repressors 
bind and thus regulate the expression of the nudeic acid. Genetic control sequences 
furthermore also encompass the 5'-untranslated region, introns or the non-coding 3'- 
region of genes. It has been demonstrated that they may play a significant role In the 
regulation of gene expression. Thus, it has been demonstrated that 5'Hintranslated 

35 sequences are capable of enhandng the transient expression of heterologous genes. 
Furthemiore, they may promote tissue spedfidty (Rouster J et aL (1998) Plant J 
15:435-440,)- Conversely, the S"-untranslated region of the Qpaque-2 gene suppresses 
expression. Deletion of the region In question leads to an increased gene activity 
(Lohmer S et ai. (1993) Plant Cell 5:65-73). Genetic control sequences may also en- 

40 compass ribosome binding sequences for initiating translation. This is prefemsd In par- 
ticular_when the nucleic add sequence to be expressed does not provide suitable se- 
quences or when they are not compatible with the expression system. 

The expression cassette can advantageously comprise one or more of what are known 
45 as enhancer sequences in operable linkage with the promoter, which enable the In- 
creased transgenic expnsssion of the nucleic add sequence. Additional advantageous 
sequences^ such as further regulatory elements or tenminators, may also be Inserted at 
the 3* end of the nucleic add sequences to be expressed recombinantly. One or more 
copies of the nucleic add sequences to be expressed recombinantly may be present In 
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the gene constnict Genetic control sequences are furthermore understood as meaning 
sequences which encode fiision proteins consisting of a s^nal peptide sequence. 

Polyadenylation signals which are suitable as genetic control sequences are plant 
5 polyadenylation signals, preferably those which comespond essentially to T-DNA 
polyadenylation signals from Agrobactarium tumefaciens. E)ompIes of particularty suit- 
able terminator sequences are the OCS (octopine synthase) temiinator and the NOS 
(nopallne synthase) tenminator. 

10 The DNA-constructs of the invention may encompass further nucleic acid sequences. 
Such nucleic add sequences may preferably constitute expression cassettes. Said 
ffurther sequences may include but shall not be limited to: 

i) Additional negative, positive or counter selection marker as described above. 

15 

ii) Report genes which encode readily quantifiable proteins and which, via intrinsic 
color or enzyme activity, ensure the assessment of the transformation efficacy or of 
the location or timing of expression. Very especially preferred here are genes 
encoding reporter proteins (see also Schenbom E, Groskreutz D. Mol Biotechnol. 

20 1 999; 13(1):29-44) such as 

- "green fluorescence protein'' (GFP) (Chui WL et aL, Curr Biol 1996, 6:325-330; 
Leffel SM et al., Biotechniques. 23(5):912-8. 1997; Sheen et al.(1995) Plant 
Journal B(S):777-7B4; Haseloff et aL(1997) Prac Natl Acad Sci USA g4(6):2122- 

25 2127; Reichel et aL(ig96) Proc Natl Acad Sci USA 93(12):5888-58g3: Tian et ai. 

(1997) Plant Cell Rep 16:267-271; WO 97/41228). 

- Chloramphenicol transferase, 

* 

30 - luciferase (Millar et al., Plant Mol Biol Rep 1992 10:324^14; Ow et aL (1986) 

Science, 234:856-859); permits the detection of bioiuminescence, 

" (^alactosidase, encodes an enzyme for which a variety of chromogenic sub- 
strates are available, 

35 

- (^glucuronidase (GUS) (Jefferson et aL, EMBO J. 1987, 6, 3901-3907) or the 
uidA gene, which encodes an enzyme for a variety of chromogenic substrates, 

- R locus gene product protein which regulates the production of anthocyanin 
40 pigments (red coloration) in plant tissue and thus makes possible the direct 

analysis of the promotor activity without the addition of additional adjuvants or 
chromogenic substrates (Deflaporta et aL, In: Chromosome Structure and Func- 
tion: Impact of New Concepts, 18th Stadler Genetics Symposium, 11:263-262. 
1988). 

45 

- B-lactamasa (Sutcliffe (1978) Proc Natl Acad Sci USA 75:3737-3741), enzyme for 
a variety of chromogenic substrates (for example PAOAC, a chromogenic cepha- 
losporin), 
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- xylE gene product (Zukowsky et al. (1983) Proc Natl Acad Scl USA 80:1101- 
1105). catechol dioxygenase capable of converting chromogenic catechols, 

- alpha-amylase (Ikuta et ai. (1990) Bio/iec^nol. 8:241-242), 

- tyrosinase (Kate et al.(1983) J Gene Microbiol 12912703-27T4), enzyme which 
oxidizes tyrosine to give DOPA and dopaquinone which subsequently form mela- 
nine, which is readily detectable. 

- aequorin (Prasher et ai.(1965) Biochem Biophys Res Commun 126(3): 1259- 
1268). can be used In the calcium-sensitive bioluminescence detection. 



10 



The DMA construct acconJIng to the invention and any vectors derived therefrann may 
comprise ftjrther functional elements. The term "further functional elements" is to be 
15 understood in the broad sense. It preferably refers to ail those elements wrhich affect 
the generation, multiplication, function, use or s/alue of said DIMA construct or vectors 
comprising said DMA construct, or cells or organisms comprising the before mentioned. 
These further functional elements may include but shall not be limited to: 

20 i) Origins of replication which ensure replication of the expression cassettes or vectors 
according to the invention In. for example, E. coli. Examples which may be men- 
tioned are ORI (origin of DIMA replication), the pBR322 on or the PISA ori (Sam- 
brook et al.: Molecular Cloning. A Laboratory Manual. 2™* ed. Cold Spring Harbor 
|_aboratory Press, Cold Spring Harbor, NY, 1989). 



il) Multiple cloning sites (MCS) to enable and facilitate the insertion of one or more 
nucleic acid sequences. 

ui) Sequences which make possible homologous recombination or insertion into the 
genome of a host organism. 

iv) Elements, for example border sequences, which make possible the Agrobactarium- 
mediated transfer in plant cells for the ti-ansfer and integration into the plant ge- 
nome, such as. for example, the right or left border of the T-DNA or the vir region. 

V. Construction of the DNA Constructs of the Invention 

Typically, constructs to be introduced into these cells are prepared using transgene 
expression techniques. Recombinant expression techniques involve the constmction of 
recombinant nucleic acids and the expression of genes in transfected celis. 

Molecular cloning techniques to achieve these ends are knoywi in the art. A wide vari- 
ety of cloning and in vitro amplification methods suitable for ttie construction of recom- 
binant nucleic adds are well-known to persons of skill. Examples of tiiese techniques 
and instructions sufficient to direct persons of skill tiirough many cloning exercises are 
found in Berger and KImmel, Guide to Molecular Cloning Techniques, Methods in En- 
zymology, Vol.1S2. Academic Press, hlc. San Diego, CA (Berger) ; T. Maniatis, E.F. 
Fritsch and J. Sambrook. Molecular Cloning: A Laboratory Manual, Cold Spring Harbor 
Laboratory. Cold Spring Harbor, NY (1989), in T.J. Silhavy, M.L Berman and 
L.W. Enquist. Experiments with Gene Fusions. Cold Spring Harbor Uboratory. Cold 
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Spring Harbor, NY (19B4) and Current Protocols In Molecular Biology, M. Ausubel et 
al., eds., Current Protocols, a joint venture between Greene Publishing Associates, Inc. 
and John Wiley & Sons, Inc., (199B Supplement), Preferably, the DMA constnjct ac- 
cording to the invention is generated by joining the abovementioned essential constitu- 
5 ents of the DNA construct together in the abovementioned sequence using the recom- 
bination and cloning techniques with which the skilled worker is familiar. 

Generally, a gene to be expressed will be present In an expression cassette, meaning 
that the gene is operably linked to expression control signals, e. g., promoters and ter- 

10 minators, that are functional in the host cell of interest The genes that encode the se- 
quence-specific DNA cleaving enzyme and, optionally, the selectable marker, will also 
be under the control of such signals that are functional In the host ceil. Control of ex- 
pression is most easily achieved by selection of a promoter. The transcription termina- 
tor is not generally as critical and a variety of known elements may be used so long as 

15 they are recognized by the cell The invention contemplates polynucleotides operably 
linked to a promoter in the sense or antisense orientation. 

A DNA construct of the invention (or a expression cassette or other nucleic acid em- 
ployed herein) is preferably introduced into cells using vectors into which these con- 
20 staicts or cassettes are inserted. Examples of vectors may be plasmids, cosmids, 
phages, viruses, retroviruses or else agrobacteria. 

The construction of polynucleotide constructs generally requires the use of vectors able 
to replicate In bacteria. A plethora of kits are commercially available for the purification 

2S of plasmids from bacteria. For their proper use, follow the manufacturer's instructions 
(see, for example, EasyPrep''^, FlexiPrep*^, both from Pharmacia Biotech; Stra- 
taClean'^, from Stratagene; and, QIAexpress^ Expression System, QIagen). The iso- 
lated and purified plasmids can then be further manipulated to produce other plasmids, 
used to transfect cells or incorporated into Agrobacterium tumefadens to infect and 

30 transform plants. Where Agrobacterium is the m^ns of transformation, shuttle vectors 
are constructed. 

However, an expression cassette (e.g., for an excision enzyme) may also be con- 
structed in such a way that the nucleic acid sequence to be expressed (for example 

35 one encoding an excision enzyme) is brought under the control of an endogenous ge- 
netic control element, for sample a promoter, for example by means of homologous 
recombination or else by random insertion. Such constructs are likewise understood as 
being expression cassettes for the purposes of the invention. The skilled worker fur- 
thermore knows that nucleic add molecules may also be expressed using artificial 

40 ' transcription factors of the zinc finger protein type (Beerii RR et al. (2000) Proc Natl 
Acad Sd USA a7(4):l495-500). These factors can be adapted to suit any sequence 
region and enable e)^ression independently of certain promoter sequences. 

VI. Target Organisms 

4S The methods of the invention are useful for obtaining marker-free plants, or cells, parts, 
tissues, harvested material derived therefrom. Accordingly, another subject matter of 
the invention relates to transgenic plants or plant cells comprising in their genome, 
preferably in their nudear. chromosomal DNA, the DNA constmct according to the In- 
vention, and to ceils, cell cultures, tissues, parts or propagation material - such as, for 
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example, in the case of plant organisms leaves, roots, seeds, fruit, pollen and the like - 
derived from such plants. 

The term "plant** includes whole plants, shoot vegetative organs/structures . {e. g. 
5 leaves, stems and tubers), roots, flov^ers and floral organs/structures (e. g. bracts, se* 
pals, petals, stamens, carpels, anthers and ovules), seeds (including embryo, en- 
dospemi, and seed coat) and fruits (the mature ovary), plant tissues (e. g. vascular 
tissue, ground tissue, and the like) and cells (e. g. guard cells, egg cells, trichomes and 
the like), and progeny of same. The class of plants that can be used in the method of 
10 the invention is generally as broad as the class of higher and lower plants amenable to 
transformation techniques, including angiosperms (monocotyledonous and dicotyledo- 
nous plants), gymnosperms, ferns, and multicellular algae. It includes plants of a vari- 
ety of ploidy levels, including aneuploid, polyploid, diploid, haploid and hemi2ygous. 

15 Included within the scope of the invention are all genera and species of higher and 
lower plants of the plant kingdom. Included are furthermore the mature plants, seed, 
shoots and seedlings, and parts, propagation material (for example seeds and fruit) 
and cultures, for example cell cultures, derived therefrom. 

20 preferred are plants and plant materials of the following plant families: Amaranthaceae, 
Brasslcaceae, Carophyllaceae, Chenopodiaceae, Compositae, Cucurbitaceae, Labi- 
atae, Legumlnosae, Papilionoideae, Llliaceae, Linaceae, Malvaceae. Rosaceae, Saxi- 
fragaceae, Scrophulariaceae, Solanaceae, Tetragoniaceae. 

25 Annual, perennial, monocotyledonous and dicotyledonous plants are preferred host 
organisms for the generation of transgenic plants. The use of the recombination sys- 
tem, or method according to the invention Is furi:hermore advantageous In all omamern 
tal plants, forestry, fruit, or ornamental trees, flowers, cut flowers, shrubs or turf. Said 
plant may include - but shall not be limited to - bryophytes such as, for example, 

30 Hepaticae (hepaticas) and Musci (mosses); pteridophytes such as ferns, horsetail and 
clubmosses; gymnosperms such as conifers, cycadSi ginkgo and Gnetaeae; algae 
such as Chlorophyceae, Phaeophpyceae, Rhodophyceae, Myxophyceae, Xanthophy- 
ceae, Bacitlariopiiyceae (diatoms) and Euglenophyceae. 

35 Plants for the purposes of the invention may comprise the families of the Rosaceae 
such as rose, Ericaceae such as rhododendrons and azaleas, Euphorbiaceae such as 
poinsettias and croton, Caryophyllaceae such as pinks, Solanaceae such as petunias. 
Gesneriaceae such as African violet Balsaminaceae such as touch-me-not. Orchida- 
ceae such as orchids, Iridaceae such as gladioli, iris, freesia and crocus, Compositae 

40 such as marigold, Geraniaceae such as geraniums, Uliaceae such as drachaena, 
Moraceae such as ficus, Araceae such as philodendron and many others. 

The transgenic plants according to the Invention are furthemnore selected in particular 
from among dicotyledonous crop plants such as. for example, from the families of the 

45 Leguminosae such as pea, alfalfa and soybean; the family of the Umbellrferae, particu- 
larty the genus Daucus (very particulariy the species carota (carrot)) and Apium (very 
particularly the species graveolens dulce (celery)) and many others; the family of the 
Solanaceae. particulariy the genus Lycopersicon, very particulariy the species esculen- | 
turn (tomato) and the genus Solanum, very particularly the spedes tuberosum (potato) 

50 and melongena (aubergine), tobacco and many others; and the genus Capsicum, very 
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particularly the species annum (pepper) and many others; the family of the Legumino- 
sae, particulariy the genus Glycine, very particularly the species max (soybean) and 
many others; and the family of the Cruciferae. particulariy the genus Brassica, very 
particularly the species napus (oilseed rape), campestris (beet), oleracea cv Tastle 
5 (cabbage), oleracea cv Snowball Y (cauliflower) and oleracea cv Emperor (broccoli): 
and the genus Arabidopsis, very particularly the species thaliana and many ottiers; the 
family of the Compositae, particularly the genus Lactuca, very particularly the species 
sativa (lettuce) and many others. 

10 The transgenic plants according to the invention are selected In particular among 
monocotyledonous crop plants, such as, for example, cereals such as wheat, bariey, 
sorghum and millet, rye, triticale. maize, rice or oats, and sugar cane. 



Further preferred are trees such as apple, pear, quince, plum, cherry, peach, nectarine, 
apricot, papaya, mango, and other woody species Including coniferous and deciduous 
trees such as poplar, pine, sequoia, cedar. oaK, etc. 

Especially preferred are Arabidopsis thaliana. Nicotiana tabacum, oilseed rape, soy- 
bean, com (maize), wheat, linseed, potato and tagetes. 

Plant varieties may be excluded, particularly registrable plant varieties according to 
Plant Breeders Rights. It is noted that a plant need not be considered a "plant variety" 
simply tjecause it contains stably within its genome a transgene, introduced into a cell 
of the plant or an ancestor thereof. 

In addition to a plant, the- present Invention provides any clone of such a plant, seed, 
selfed or hybrid progeny and descendants, and any part or propagule of any of these, 
such as cuttings and seed, which may be used in reproduction or propagation, sexual 
or asexual. Also encompassed by the invention is a plant which is a sexually or asexu- 
ally propagated off-spring, clone or descendant of such a plant, or any part or 
propagule of said plant, off- spring, clone or descendant. 

Plant organisms are furthennore. for the purposes of the invention, other organisms 
which are capable of photosynthetic activity, such as. for example, algae or cyanobac- 
35 teria, and also mosses. Prefenred algae are green algae, such as, for example, algae of 
the genus Haematococcus, Phaedactytum tricomatum, Volvox or Dunaliella. 

Genetically modified plants according to the invention which can be consumed by hu- 
mans or animals can also be used as food or feedstuffe, for example directly or follow- 
40 Ing processing known in the art 

VII. Methods for Introducing Constructs into Target Cells 

A DNA construct according to the invention may advantageously be introduced into 
cells using vectors Into which said DNA constaict is Inserted. Examples of vectors may 
45 be plasmids, cosmids, phages, viruses, retroviruses or agrobacteria. In an advanta- 
geous embodiment, the expression cassette is introduced by means of plasmid vec- 
tors. Prefen^d vectors are those which enable the stable integration of the .expression 
cassette into the host genome. 
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The DNA construct can be introduced into the target plant f:ells and/or organisms by 
any of the several means known to those of skill in the art, a procedure which is termed 
transformation Xsee also Keown et al. (1990) Meth Enzymol 185:527-537). Production 
of stable, fertile transgenic plants in almost all economically relevant monocot plants is 
S now routine:(Toriyama. et al. (1988) Bio/Technology 6:1072-1074; Zhang et al. (1988) 
Plant Cell Rep. 7:379-384; Zhang, et al. (1988) Theor AppI Genet 76:835-840: Shlma- 
moto et al. (1989) Nature 5338:274-276; Datta et al. (1990) Bio/Technology 8:736- 740; 
Christou et al. (1991) Bio/Technology 9:957-962; Peng, et al. (1991) international Rice 
Research Institute, Manila. Philippines 563-574; Cao et al. (1992) Plant Cell Rep 

10 1 1:585-591; LI et ai. (1993) Plant Cell Rep. 12:260-255; Rathore et al. (1993) Plant Mol 
Biol 21:871-864; Fromm et al. (1990) Bio/Technology 8:833-839; Gordon-Kamm et al. 
(1990) Plant Cell 2:603-618; D'l-ialluin et al. (1992) Plant Cell 4:1495-1505; Walters et 
al. (1992) Plant Mol Biol 18:189-200; Koziel et al. (1993) Biotechnology 11:194-200; 
Vasil IK (1994) Plant Mol Biol 25. 925-937; Weeks et al. 11993) Plant Physiology 102, 

15 1077-1084; Somers et al. (1992) BJo/Technology 10, 1689-1594; WO 92/14828). 

For instance, the DNA constructs can be introduced into cells, either in culture or in the 
organs of a plant by a variety of conventional techniques. For example, the DNA con- 
structs can be introduced directly to plant cells using ballistic methods, such as DNA 

20 particle bombardment, or the DNA construct can be introduced using techniques such 
as electroporation and microinjection of a cell. Particle-mediated transformation tech- 
niques (also known as "biolistics'*) are described in, e.g., Klein et aL (1987) Nature 
327:70-73; Vasil V et al. (1993) Bio/Technol 11:1553-1558; and Becker D et al. (1994) 
Plant J 5:299-307. These methods involve penetration of cells by small particles with 

25 the nucleic acid either within the matrix of small beads or particles, or on the surface. 
The biolistic PDS-1000 Gene Gun (Biorad, Hercules, CA) uses helium pressure to ac- 
celerate DNA-coated gold or tungsten microcam*ers toward target cells. The process is 
applicable to a wide range of tissues and cells from organisms, including plants. Other 
transformation methods are also known to those of skill in the art. 

Microinjection techniques are known in the art and are well described in the scientific 
and patent literature. Also, the cell can be permeabllized chemically, for example using 
polyethylene glycol, so that the DNA can enter the cell by diffusion. The DNA can also 
be introduced by protoplast fusion v\rith other DNA-containing units such as minicells, 
35 ceils, lysosomes or liposomes. The introduction of DNA constructs using polyethylene 
glycol (PEG) precipitation is described in Paszkowski et al. (1984) EMBO J 3:2717. 
Liposome-based gene delivery is e.g., described in WO 93/24640; Mannino and Gould- 
Fogerite (1988) BioTechmques 6(7):682-691; US 5,279,833; WO 91/06309; and Fei- 
gner et al- (1987) Proc Natl Acad Sci USA 84:7413-7414). 

Another suitable method of introducing DNA is electroporation, where the cells are 
permeabllized reversibly by an electrical pulse. Electroporation techniques are de- 
scribed in Fromm et al. (1985) Proc Natl Acad Sci USA 82:5824. PEG~mediated trans- 
formation and electroporation of plant protoplasts are also discussed in Lazzeri P 
45 (1995) Methods Mol Biol 49:95-106. Preferred general methods which may be men- 
tioned are the calcium-phosphate-mediated transfection, the DEAE-dextran-mediated 
transfection, the catlonic lipid-mediated transfection, electroporation, transduction and 
infection. Such methods are known to the skilled worker and described, for example, in 
Davis et al., Basic Methods In Molecular Biology (1986). For a review of gene transfer 
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methods for plant and cell cultures, see, Fisk ot al. (1993) Scientia Hortfculturae 55:5- 
36 and Potrykus (1990) CIBA Found Symp 154:198. 

Methods are known for introduction and expression of lieterologous genes in both 
5 monocot and dicot plants. See, e.g., US 5,633.446, US 5,317,096, US 5,689.062, US 
5,159,135, and US 5,679,558; Weising et ai. (1988) Ann. Rev. Genet. 22: 421-477. 
Transformation of monocots in particular can use various techniques including electro- 
poration (e.g., Shimamoto et al. (1992) Nature 338:274-276; blolistlcs (e.g., EP-A1 
270,356): and Agrobacterium (e.g., Bytebier et al. (1987) Proc Natl Acad Sci USA 
10 84:5345-5349). In particular, Agrobacterfum mediated transformation is now a highly 
efficient transfonnation method in monocots (Hlei et al. (1994) Plant J 6:271-282). As- 
pects of the invention provide an expression vector for use in such transformation 

methods which is a disarmed Agrobacterium Ti plasmid, and an Agrobacterium tume- 
faciens bacteria comprising such an expression vector. The generation of fertile trans- 
15 genie plants has been achieved using this approach in the cereals rice, maize, wheat, 
oat, and bariey (reviewed in Shimamoto K (1994) Current Opinion in Biotechnology 
5:158-162; Vasil et al. (1992) Bio/Technology 10:667-674; Vain et al. (1995) Biotech- 
nology Advances 13(4):653-671; Vasil (1996) Nature Biotechnology 14:702; Wan & 
Lemaux (1994) Plant Physiol. 104:37-48) 

20 

Other methods, such as microprojectile or particle bombardment (US 5,100,792, EP-A- 
444 882, EP-A-434 616). electroporation (EP-A 290 395, WO 87/06614), microinjection 
(WO 92/09696. WO 94/00583. EP-A 331 083, EP-A 175 966, Green et al. (1987) Plant 

Tissue and Cell Culture. Academic Press) direct DNA uptake (DE 4005152, WO 
25 90/12096, US 4,684.611), liposome mediated DNA uptake (e.g. Freeman et al. (1984) 
Plant Cell Physiol 2 9:1353), or the vortexing method (e.g., Kindle (1990) Proc Natl 
Acad Sci USA 87:1228) may be preferred where Agrobacterium transfonnatlon is inef- 
ficient or ineffective. 

30 In particular, transformation of gymnosperms, such as conifers, may be performed us- 
ing partide bombardment 20 techniques (Clapham D et al. (2000) Scan J For Res 15: 
151-160). Physical methods for the transformation of plant cells are reviewed in Oard. 
(1991) Biotecrfi. Adv. 9 :1-11. Alternatively, a combination of different techniques may 
be employed to enhance the efficiency of the transformation process, e.g. bombard- 

35 ment with Agrobacterium coated microparticles (EP-A-486234) or microprojectile bom- 
bardment to induce wounding followed by co-cultivation with Agrobacterium (EP-A- 
486233) . 

In plants, methods for transforming and regenerating plants from plant tissues or plant 
40 cells with which the skilled woricer is familiar are exploited for trarisient or stable trans- 
formation. Suitable methods are espedally protoplast transformation by means of poly- 
ethylene-glycol-induced DNA uptake, biolistic methods such as the gene gun ("partide 
bombardment" method), electroporation, the incubation of dry embryos in DNA- 
containing solution, sonicatlon and microinjection, and the transfonnafion of Intact cells 
45 or tissues by micro- or macroinjection into tissues or embryos, tissue electroporation. or 
vacuum infiltration of seeds, in the case of injection or electroporation of DNA into plant 
ceils, the plasmid used does not need to meet any particular requirement. Simple 
plasmids such as those of the pUC series may be used. If intact plants are to be re- 
generated fi-om the transformed cells, the presence of an additional selectable martcer 
50 gene on the plasmid is useful. 
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In addition to these "direct" transformation techniques, transformation can also be ear- 
ned out by bacterial infection by means of Agrobacterium tumefaciens or Agrobacte- 
rium rhizogenes. These strains contain a plasmid (Tl or Ri plasmid). Part of this plas- 
mid, termed T-DNA (transferred DNA), is transfen-ed to the plant following agrobacterial 
5 infection and integrated into the genome of the plant cell. 

For Agrobacterium-mediated transformation of plants, the DNA construct of the inven- 
tion may be combined with suitable T-DNA flanking regions and introduced into a con- 
ventional Agrobacterium tumefaciens host vector. The virulence functions of the A. 

10 tumefaciens host will direct the insertion of a transgene and adjacent marker gene(s} (if 
present) into the plant cell DNA when the cell is Infected by the bacteria. Agrobacterium 
tumefaciens-mediated transformation techniques are well described in the scientific 
literature. See. for example. Horsch et al. (1984) Science 233:496-498, Fraley et al. 
(1983) Proc Natl Acad Sci USA 80:4803-4807, Hooykaas (1989) Plant Mol Biol 13:327- 

15 335, Horsch RB (1986) Proc NatJ Acad Sci USA 83(8):2571-2575), Bevans et aL 
(1983) Nature 304:184-187, Bechtold et al. (1993) Comptes Rendus De L'AcademIe 
Des Sciences Serie Ill-Sciences De La Vie-Life Sciences 316:1194-1199, Vaivekens et 
al. (1988) Proc Natl Acad Sci USA 85:5536-5540. 

20 The DNA construct is preferably integrated into specific plasmids, either into a shuttle, 
or intermediate, vector or into a binary vector). If, for example, a T\ or Ri plasmid is to 
be used for the transformation, at least the right border, but in most cases the right and 
the left border, of the Ti or Ri plasmid T-DNA is linked with the expression cassette to 
be introduced as a flanking region. Binary vectors are preferably used. Binary vectors 

25 are capable of replication both in coli and in Agrobacterium. As a rule, they contain a 
selection marker gene and a linker or polylinker flanked by the right or left T-DNA flank- 
ing sequence. They can be transformed directly into Agrobacterium (Holsters et al. 
(1978) Mol Gen Genet 163:181-187). The selection maricer gene pennits the selection 
of transformed agrobacteria and is, for example, the DAAO gene of the Invention, 

30 which imparts resistance to D-amino acids like D-alanine (see above). The Agrobacte- 
rium, v^ich acts as host organism in this case, should already contain a plasmid with 
the vir region. The latter is required for transfemng the T-DNA to the plant cell. An 
Agrobacterium thus transformed can be used for transfomiing plant cells. 

35 Many strains of Agrobacterium tumefaciens are capable of transferring genetic material 
• for example the DNA construct according to the invention % such as, for example, the 
strains EHA101[pEHA101] (Hood EE et al. (1996) J Bacterid 168(3): 1291 -1301), 
EHA105[pEHA105] (Hood et al. 1993, Transgenic Research 2, 208-218), 
LBA4404[pAL4404] (Hoekema et aK (1983) Nature 303:179-181), C58Cl[pMP90] 

40 (Koncz and Schell (1986) Mol Gen Genet 204,383-396) and C58C1[pGV2260] (De- 
biaere et al. (1985) Nucl Acids Res, 13, 4777-4788). 

The agrobacterial strain employed for the transformation comprises, in addition to its 
disarmed Tl plasmid, a binary plasmid with the T-DNA to be transferred^ which, as a 

45 rule, comprises a gene for the selection of the transformed cells and the gene to be 
transferred. Both genes must be equipped with transcriptional and translational initia- 
tion and termination signals. The binary plasmid can be transferred into the agrobacte- 
rial. strain, for example by electroporation or other transformation methods (Mozo & 
Hooykaas (1991) Plant Mol Biol 16:917-918). Co-culture of the plant explants with the 

50 • agrobacterial strain is usually performed for two to three days. 
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A variety of vectors could, or can, be used. In principle, one differentiates between 
those vectors which can be employed for the agrobacterium-mediated transformation 
or agroinfection, i.e. which comprise the DNA construct of the invention within a T- 
DNA. which indeed permits stable integration of the T-DIMA into the plant genome. 
5 Moreover, border-sequence-free vectors may be employed, which can be transfomied 
into the plant cells for example by particle bombardment, where they can lead both to 
transient and to stable expression. 

The use of T-DNA for the transfomiation of plant cells has been studied and described 
10 intensively (EP-A1 120 516; Hoekema, In: The Binary Plant Vector System, Offset- 
daikkerij Kanters B. V., Alblasserdam, Chapter V; Fraley et aL (1985) Crit Rev Plant 
Sci 4:1-45 and An et aL (1985) EMBO J 4:277-287). Various binary vectors are known, 
some of which are commercially available such as, for example, pBlN19 (Clontech 
Laboratories, Inc. USA). 

To transfer the DNA to the plant cell, plant explants are cocultured with Agrobacterium 
tumefaciens or Agrobacterium rhizogenes. Starting from infected plant material (for 
example leaf, root or stalk sections, but also protoplasts or suspensions of plant cells), 
Intact plants can be regenerated using a suitable medium which may contain, for ex- 
20 ample, antibiotics or biocides for selecting transformed cells. The plants obtained can 
then be screened in the presence of the DNA introduced, in this case the DNA con- 
struct according to the invention. As soon as the DNA has integrated into the host ge- 
* nome, the genotype in question Is, as a rule, stable and the insertiion in question is also 
found in the subsequent generations. Preferably the stably transformed plant is se- 
25 lected using the method of the invention (however other selection schemes employing 
other selection, mariners comprised in the DNA construct of the Invention may be used). 
The plants obtained can be cultured and hybridized in the customary fashion. Two or 
more generations should be grown in order to ensure that the genomic integration is 
stable and hereditary. 

The abovementloned methods are described, for example, in B. Jenes et aL, Tech- 
niques for Gene Transfer, in: Transgenic Plants, Vol. 1. Engineering and Utilization, 
edited by SD Kung and R Wu. Academic Press (1993), 128-143 and In Potrykus (1991) 
Annu Rev Plant Physiol Plant Molec Biol 42:205-225). The construct to be expressed is 
35 preferably cloned into a vector which is suitable for the transfomiation of Agrobacterium 
tumefaciens, for example pBinl9 (Sevan et al. (1984) NucI Acids Res 12:8711). 

The DNA construct of the invention can be used to confer desired trails on essentially 
any plant One of skill will recognize that after DNA constmct is stably incorporated in 
40 transgenic plants and confirmed to be operable, it can be introduced into other plants 
by sexual crossing. Any of a number of standard breeding techniques can be used, 
depending upon the species to be crossed. 

VIII. Regeneration of Transgenic Plants 
45 Transfonned cells, le, those which comprise the DNA integrated into the DNA of the 
host cell, can be selected from untransformed cells preferably using the selection 
method of the Invention. As soon as a transformed plant cell has been generated, an 
Intact plant can be obtained using methods known to the skilled worker For example, 
callus cultures are used as starting material. The fonination of shoot and root can be 
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induced in this as yet undifferentiated cell biomass in the known fashion. The shoots 
obtained can be planted and cultured. 

Transfonned plant cells, derived by any of the above transfonnation techniques, can be 
S cultured to regenerate a whole plant which possesses the transformed genotype and 
thus the desired phenotype. Such regeneration techniques rely on manipulation of cer- 
tain phytohormanes in a tissue culture growth medium, typically relying on a biodde 
and/or herbicide marker that has been introduced together with the desired nucleotide 
sequences. Plant regeneration from cultured protoplasts is descnbed In Evans et ai., 

10 Protoplasts Isolation and Culture, IHandbook of Plant Ceil Culture, pp. 124176, Macmil- 
lian Publishing Company, New York (1983); and in Binding, Regeneration of Plants, 
Plant Protoplasts^ pp. 21-73, CRC Press, Boca Raton (1985). Regeneration can also 
be obtained from plant callus, explants, somatic embryos (Dandekar et al. (1989) J 
Tissue Cult Meth 12:145; McGranahan et aL (1990) Plant Cell Rep 8:512), organs, or 

15 parts thereof. Such regeneration techniques are described generally in Kiee et al. 
(1 987) Ann Rev Plant Physiol 38:467-486. Ottier available regeneration techniques are 
reviewed in Vasil et al., Cell Culture and Somatic Cell Genetics of Plants , Vol 1. II, and 
III, Laboratory Procedures and Their Applications, Academic Press, 19B4, and Weiss- 
bach and Weissbach, Methods for Plant Molecular Biology, Academic Press, 1989. 

20 

IX. Generation of descendants 

After transformation, selection and regeneration of a transgenic plant (comprising the 
DMA construct of the invention) descendants are generated, which — because of the 
activiiy of the excision promoter underwent excision and do not comprise the marker 
25 sequence(5) and expression cassette for the endonucleaae. 

Descendants can be generated by sexual or non-sexual propagation. Non-sexual 
propagation can be realized by introduction of somatic embryogenesrs by techniques 
well known in the art. Preferably, descendants are generated by sexual propagation / 
30 fertilization. Fertilization can be realized either by selfing (setf-polfination) or crossing 
with other transgenic or non-transgenic plants. The transgenic plant of the invention 
can herein function either as maternal or patemal plant. 

* 

After the fertilization process, seeds are harvested, germinated and grown into mature 
35 plants. Isolation and identification of descendant which underwent the excision proc- 
ess can be done at any stage of plant development. Methods for said identification are 
well known in the art and may comprise - for example - PCR analysis. Northern blot, 
Southern blot, or phenotypic screening (e.g., for an negative selection marker). 

40 Descendants may comprise one or more copies of the agronomically valuable trait 
gene. Preferably, descendants are isolated which only comprise one copy of said trait 
gene. It is another inventive feature of the present invention that multiple insertion (e.g., 
of a T-DNIA) in one genomic location will be reduced to a single insertion event by exci- 
sion of the redundant copies (Fig. 10). 

45 

In a preferred embodiment the transgenic plant made by the process of the invention is 
marker-free. The tenns "marker-^free" or "selection marker free** as used herein with 
respect to a cell or an organisms are intended to mean a cell or an organism which is 
not able to express a functional selection marker protein (encoded by expression cas- 
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sette b; as defined above) which was Inserted into said cell or organism in combination 
with the gene encoding for the agronomically valuable trait. The sequence encoding 
said selection marker protein may be absent in part or preferably - entirely. Further- 
more the promoter operably linked thereto may be dysfunctional by being absent In 
5 part or entirely. 

The resulting plant may however comprise other sequences which may function as a 
selection marker. For example the plant may comprise as a agronomically valuable trait 
a herbicide resistance conferring gene. However, it is most preferred that the resulting 
10 plant does not comprise any selection marker. 

Also In accordance with the invention are cells, cell cultures, parts - such as. for exam- 
ple, in the case of transgenic plant organisms, roots, leaves and the like - derived from 
the above-described transgenic organisms, and transgenic propagation material (such 
15 as seeds or fruits). 

Genetically modified plants according to the invention which can be consumed by hu- 
mans or animals can also be used as food or feedstuffs, for example directly or follow- 
ing processing known per se. Here, the deletion of, for example, resistances to antibiot- 
20 ICS and/or herbicides, as are frequently Introduced when generating the transgenic 
plants, makes sense for reasons of customer acceptance, but also product safety. 

A further subject matter of the invention relates to the use of the above-described 
transgenic organisms according to the invention and the cells, cell cultures, parts - 
25 such as, for example, in the case of transgenic plant organisms, roots, leaves and the 
like - derived from them, and transgenic propagation material such as seeds or fruits, 
for the production of food or feedstufis, pharmaceuticals or fine chemicals. Here again, 
the deletion of, for example, resistances to antibiotics and/or herbicides is advanta- 
geous for reasons of customer acceptance, but also product safety. 

30 

Fine chemicals is understood as meaning enzymes, vitamins, amino acids, sugars, 
fatty acids, natural and synthetic flavors, aromas and colorants. Especially preferred is 
the production of tocopherols and tocotrienols, and of carotenoids. Culturing the trans- 
formed host organisms, and isolation from the host organisms or from the culture me- 
35 dium. is performed by methods known to the skilled worker. The production of phanria- 
ceuticals such as, for example, antibodies or vaccines, is described by Hood EE, Jilka 
JM. (1999) Cun- Opin BiotechnoL 10(4):3B2-386; Ma JK and Vine ND (1999) Cun- Top 
Microbiol lmmunoL236:275-92). 

40 Various further aspects and embodiments of the present invention will be apparent to 
those skilled In the art in view of the present disclosure. All documents mentioned In 
this specification are incorporated herein in their entirety by reference. Certain aspects 
and embodiments of the invention will now be Illustrated by way of example and with 
reference to the figure described below. 
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X. Sequences 

1.SEQ ID NO:1: 



2. SEQ ID NO:2: 



Nucleic add sequence encoding D-amino acid oxidase from 
Rhodosporidium toruloides (Yeast) 

Amino acid sequence encoding D>amlno acid oxidase from 
Rhodosporidium toruloides (Yeast) 



3. SEQ ID NO:3; Nucleic acid sequence encoding D-amino add oxidase from 
10 Caenorhabditis elegans 

4. SEQ ID NO:4: Amino add sequence encoding D-amIno acid oxidase from Cae- 

norhabditis elegans 

15 5. SEQ ID NO:5: Nucleic acid sequence encoding D-amlno add oxidase from Nec- 

tria haematococca 



20 



6. SEQ ID NO:6: Amino add sequence encoding D-amino acid oxidase from Nec- 

tria haematococca 

7. SEQ ID NO:7: Nucleic acid sequence encoding D-amino add oxidase from 

Trigonopsis variabilis 

8. SEQ ID NO:8: Amino add sequence encoding D-amino add oxidase from Tri- 
gonopsis variabilis 

9- SEQ ID NO:9: Nucleic add sequence encoding D-amIno add oxidase from 

Schizosaccharomyces pombe (fission yeast) 

30 10. SEQ ID NO: 10; Amino acid sequence encoding D-amino acid oxidase from 

Schizosaccharomyces pombe (fission yeast) 



25 



11. SEQ ID NO:11: 



Nudeic acid sequence encoding D-amino acid oxidase from 
Streptomyces coelicolor A3(2) 



35 



12. SEQ ID NO:12: Amino add sequence encoding D-amino acid oxidase from 

Streptomyces coelicolor A3(2) 



13. SEQID NO:13: Nucleic add sequence encoding D-amino add o^odase from 
40 Candida boidinii 

14. SEQ ID NO:14: Amino acid sequence encoding D-amino acid oxidase from Can- 

dida boidinii 



45 
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IS. SEQ ID NO: 15: Nucleic acid sequence encoding vector daaoScelTetON (length: 





12466 bp) 








Feature 


Poaftion (Base No,) 


Orientation 


5 


LB (Left Border) 


7618-7B34 


direct 




35SpA (35S terminator) 


7345 - 7549 


complement 


* 


ptxA promoter 


6479 - 7341 


complement 




rtTA (reverse tetracycline 








transcription transactivator ) 


5416 - 6425 


complement 


10 


OCS-T (OCS terminator) 


5118-5343 


complement 




nitlP(Nrt1 promoter) 


3217-5023 


complement 




daao (D-amino acid oxidase) 


2067-3173 


complement 




nosT (nos terminator) 


1735 - 1990 


complement 




pTOPlOP (let regulated pramoter) 1270 - 1660 


complement 


15 


l&cel (l-Scel endonuctease) 


515-1222 


complement 




l-Sce recognition/cleavage site 


445 - 462 


direct 




3SSpA (35S terminator) 


196-400 


complement 




RB (right border) 


38-183 


direct 



20 16. SEQ ID NO:16: 



25 



30 



Nucleic acid sequence encoding vector daaoNit-PRecombinatlon 
(length: 12539 bp) 



Feature 

LB (Left Border) 

STPT (sTPT promoter) 

GUS (GUS gene) 

35SpA (358 terminator) 

Niti P (NItl promoter) 

daao (D-amlno acid 0)ddase) 

nosT (nos terminator) 

l-^ce recognition/cleavage site 

35SpA (35S terminator) 

RB (right border) 



PofiKion (Base No.) 


Orientation 


7691 - 7907 


direct 


7619-6302 


complement 


6248 - 4251 


complement 


4176 - 3972 


complement 


3862 - 2071 


complement 


2027 - 921 


complement 


644 569 


complement 


445 - 462 


direct 


400- 196 


complement 


38-183 


direct 
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XI. BRIEF DESCiUPTiON OF THE FIGURES 

Fig. 1: Basic Principle of the dual-function selection marker 

A: A mixture population consisting of wild-type, non-transgenic plants (gray 
color) and transgenic plants comprising the dual-function marker (black color) 
is treated with either D-alanine or D-isoleudne. While the toxic effect of D- 
alanine on non-transgenic plants is detoxified by the transgene-mediated con- 
version (thereby selectively killing the wild-type plantlets), the non-toxic D- 
isoleudne is converted by the same enzymatic mechanism into a phytotoxic 
compound (thereby selectively killing the transgenic plantiets). 

B: The dual-function of the marker can be employed subsequentiy for con- 
struction of marker-free transgenic plants. While the function as a negative se- 
lection marker is utilized to allow for insertion of a transgene comprising a 
gene of interest (GOI) into a wild-type plant (gray color), the counter-selection- 
function Is employed to subsequently delete the selection marker by combin- 
ing marker-deletion technology and counter-selection (thereby killing the dual- 
function marker comprising plantiets (blade-color)) resulting in plantiets com- 
prising the GOI but lacking the dual function marker (gray hatching). 



Rg. 2: Wild-type Arabidopsis thaliana plantiets (left side) and transgenic plantiets 
20 comprising tiie dual function marker (DAAO gene from Rhodotorula gracilis) 

are treated with either 30 mM D-isoleucine (upper side) or 30 mM D- alanine 
(bottom side). A to)dc effect of D-isoleucine on the transgenic plants and D- 
alanine on the wild-type plants, respectively, can be observed, while no severe 
damage can be detected on the respective other group, thereby allowing for 
25 clear distinguishing and easy selection of either transgenic or wild-type plants. 



3 Effect of various D-amino acids on plant growth. 

Wild type Arabidopsis thaliana plantiets were grown on half-concentrated Mu- 
rashige-Skoog medium (0,5% (wt/vol) sucrose, 0.8% (wt/vol) agar) supple- 
30 mented with the indicated D-amino acid at either 3 mM (Panel A) or 30 mM 

(Panel B). While D-alanine and D-serine are imposing severe phytotoxic ef- 
fects even at 3 mM concentrations no significant effects can be observed for 
D-isoleucine. 

35 Rg. 4 D-amino acid dose responses of daol transgenic and wild-t/pe A. thaliana. 

(a-d) Growth of daol transgenic line 3:7 (white), 10:7 (light gray), 13:4 (gray) 
and vMld-type (black) plants, in fresh weight per plant, on media containing 
various concentrations of D-serine, D-alanine, D-tsoIeucine and D-valine in 
haif-strength MS with 0.5% (wt/vol) sucrose and 0.8% (wtA/ol) agar. Different 
40 concentration ranges were used for different D-amino acids. The plants were 

grown for 10 d after germination under 16 h photoperiods at 24 ^'C; n = 10 ± 
s.e.m., except for plants grown on D-isoleucine, where smaller Petri dishes 
were used, (n 6 ± s.e.m.). 

45 (e-l) Photographs of daol transgenic line 10:7 (e-h) and wild-type plants (M), 

grown for 10 d on the highest concentrations of the D-amino acid shown in the 
respective graphs above. Ail pictures have the same magnification. FW, fresh 
weight 
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Selection of primary transformants with the DAAO marker 

(a-c) DAAO T1 seedlings on media containing 3 mWI D-alanine (a), 3 mM D- 
serine (b) and SO pQ nnl-1 kanamycin (c). Seeds were surface-sterilized and 
sown on half-strength MS plates with 0.5% (wtA^ol) sucrose, O.B% (wt/vol) 
agar and the respective selective compound, then grown for 5 d after germina- 
tion under 16 h photoperiods at 24 ''C. 

(d) DAAO transgenic plants grown on soil photographed after selection by 
spraying with (1) D-alanine and (2) D-serine, and wild-type plants sprayed with 
10 (3) D-alanine and (4) D-serine. Eight seeds per plot and treatment were sown 

on soil, and grown for 7 d after germination before applying the selective 
treatment, which consisted of spraying with aqueous 50 mM solutions of D- 
alanine or D-serine with 0.05% Tween 80 on three consecutive days. 

15 (e) Northem blot analysis of daol mRNA levels from sbc D-serine- and D- 

alanine-resistant lines and wild-type plants. Ten pg total RNA was loaded per 
lane and separated on an agarose gel. Ethidium bromide-stained total RNA 
bands are shown as loading controls, (f) DAAO activity in six transgenic lines 
and wild type. A unit of DAAO activity is defined as the turnover of one mlcro- 

20 mole of substrate per minute. Bars represent means ± s.e.m., /i = 3. 

Fig. 6-^7 Demonstration of broad applicability of the selection system. D-serine is im- 
posing toxic effects on a variety of different plant species both monocotyle- 
donous and dicotyledonous plants. Effects are demonstrated for popular (Fig- 
25 6A), bariey (Fig. 6B), tomato (Fig. 6C), tobacco (Fig. 7A). Arabidopsis thaliana 

(Fig. 7B), and Com (Zea mays, Fig. 7C). Similar effects are obtained when us- 
ing D-alanine instead of D-serine. 

Rg. B-1 D: Preferred constructs of the invention 

The following abbreviations apply to Vtia figures In general: 

A: Sequence A allowing for sequence deletion (e.g., recognition site for 

recombinase or homology sequence) 
A': Sequence A* allowing for sequence deletion (e.g., recognition site for re- 
combinase or homology sequence) 
A/A: Sequence as the result of (homologous) recombination of A and A 
DAAO: Sequence encoding a diamine acid oxidase having dual-function marker ac- 
tivity. 

EN: Sequence encoding sequence specific DNA-endonuclease 
Trail: Sequence coding for e.g., agronomically valuable trait 
Pr,: promoter 

RSn: Recognition sequence for the site-directed induction of DIsIA double-strand 
breaks (e.g., S1: First recognition sequence). The recognition sequences 
may be different (e,g., functioning for different endonudeases) or -preferably 
- identical (but only placed in different locations). 
Rn or Sn: Part of recognition sequence RS„ remaining after deavage 
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Fig.: S Preferred basic instruct and method 

A vector comprising the DMA construct (preferably a circular Agrobacterium 
binary vector) is employed comprising: A first expression cassette for the dual- 
function marker (DAAO) under control of a promoter functional in plants (PI) 
5 and a second esqi^ression cassette for an agronomically valuable trait (TRAIT) 

also under control of (preferably a different) promoter functional in plants (P2), 
The first expression cassette is flanked by sequences which allow for specific 
deletion of said first expression cassette (A and A'). A and A' may be se- 
quences for a sequence-specific recombinase or sequences which allow for 
1 0 homologous recombination between each other For the later alternative, two 

identical sequences can be arranged in form of a directed repeat 

The DMA construct is Inserted into plant cells (1.) and selection is performed 
making use of the negative selection function of the dual function marker (2.) 

15 e.g., employing D-alanine or D-serine. Thereby plant cells or plants are se- 

lected comprising the DMA construct. Based on said plant cells or plants dele- 
tion of the first expression cassette is initiated (3.) and selection is performed 
making use of the counter-selection function of the dual function marker (4.) 
e.g., employing D-isoleucine or D-^valine. Thereby plant cells or plants are se- 

20 lected comprising the second expression cassette but lacking the first expres- 

sion cassette. 

Fig.: 8 Construct mediating marker excision via induced homologous recombination 

The DMA construct introduced into the plant genome by utilizing the negative 
selection marker function of the dual-function marker is comprising: A first ex- 
pression cassette for the dual-function marker (DAAO) under control of a pro- 
moter functional in plants (P1) and a second expression cassette for an 
agronomically valuable trait (TRAIT) also under control of (preferably a differ- 
ent) promoter functional in piante (P2). The first expression cassette is flanked 
by homology sequences A and A' which allow for homologous recombination 
between each other, being arranged in form of a directed repeat. Within the 
DMA construct there is at least one (preferably - as depicted here - two) reo- 
ognition sequences (RS) (cleavage sites) for a sequence specific endonucle- 
ase (RSi, RSa). The two sequences may be different (i.e., for different en- 
donucleases) or — preferably - identical Cleavage at said recognition se- 
quences (RSi and R&z) is initiated by the corresponding endonucleasa (1 .) re-- 
suiting in double-strand breaks, which are ''repaired*' by homologous recombi- 
nation between the homologous end-sequences (probably supported by the 
cellular DMA repair mechanism). The resulting genome still comprises the 
second expression cassette for the trait but lacks the first expression cassette 
for the dual-function marker. Selection is performed making use of the 
counter-selection function of the dual function marker (2.) e.g., employing D- 
isoleucine or D-valine. Thereby plant cells or plants are selected comprising 
the second expression cassette but lacking the first expression cassette. 
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Fig.: 10 Constntct mediating marker excision via induced homologous recombination 

In an preferred embodiment the DNA constaict introduced into the plant ge- 
nome further comprises a third expression cassette for the sequence specific 
endonudease (or if recombinases are utilized for the recomblnase). The first 
expression cassette (for the dual-function marker) and the third expression 
cassette (for the endonudease) are together flanked by homology sequences 
A and A' which allow for homologous recombination between each other, be^ 
ing arranged In fomi of a directed repeat Within the DNA construct there is at 
least one (preferably - as depicted here - two) recognition sequences (RS) 
(cleavage sites) for a sequence specific endonudease (RSi, RSz). The two 
sequences may be different (I.e., for different endonudeases) or — preferably-- 
identical. Expression of the corresponding endonudease from the third ex- 
pression cassette is initiated (1.)i resulting in deavage at said recognition se- 
quences (RSi and RS^) thereby forming in double-strand breaks (2,), which 
are "repaired" by homologous recombination between the homologous end- 
sequences (probably supported by the cellular DNA repair mechanism). The 
resulting genome still comprises the second expression cassette for the trait 
but lacks tha first and third expression cassette. Selection is performed mak- 
ing use of the counter-selection function of the dual function marker (3.) e.g., 
employing D-isoleucine or D-valine. Thereby plant cells or plants are selected 
comprising the second expression cassette but lacking the first and third ex- 
pression cassette. 

Preferably the expression of the endonudease is controllable e.g., by employ- 
ing an inducible promoter (see below for details). 

Fig.: 11 D- amino adds are applicable by spraying procedure 

DAAO transgenic plants grown on soil photographed after selection by spray- 
ing with (1) D-alanine and (2) D-serine, and wild-type plants sprayed with (3) 
D-alanlne and (4) D-serine. Eight seeds per plot and treatment were sown on 
soil, and grown for 7 d after germination before applying the seledive treat- 
ment, which consisted of spraying with aqueous 50 mM solutions of D-alanine 
or D-serine with 0,05% Tween 80 on three consecutive days. 
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1g.: 12 Alignment of the catalytic site of various D-amino add oxidases 

Multiple alignment of the catalytic site of various D-amino add oxidases allows 
for determination of a characteristic sequence motif [LIVMHLI\^]-HMNHA]- 
Y-G-x-[GSAHGSA]-x-G-X5-G-x-A, which allows for easy identification of addi- 
tional D-amino acid oxidases suitable to be employed within the method and 
DNA-constructs of the invention. 
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Fig.: 13 Vector map of construct daaoScelTetOn (Seq ID NO: 15) (length: 12466 bp) 
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Abbreviation 


Feature PosMon (Base No.) 


Orientation 


LB 


Left Border 


7618-7834 


direct 


35SpA 


35S tenminator 


7345 - 7549 


complement 


ptxA promater 




6479 - 7341 


complement 


rtiA 


Tet repressor 


5418 - 6425 


complement 


OCS-T 


OCS terminator 


5118-5343 


complement 


nit1P 


Nitl promoter 


3217-5028 


complement 


daao 


D-amino add oxidase 


20S7-3173 


complement 


nosT 


nos terminator 


1735-1990 


vuiii|j|Briicni 


pTOPlOP 


tet regulated promoter 


1270 - 1660 


complement 


ISecl 


t»Secf endonuclease 


515-1222 


complement 


I^Sce recognition/cleavage site 


445-462 


direct 


35SpA 


35S terminator 


196-400 


complement 


RB 


right border 


38-183 


direct 


CoIEl 


ColEI origin of replication (E.coli) 




aadA 


Spectomyoln/Strepatomycin resistance 




repA/pVS1 


repA origin of replication (Agrobacterium) 





Furthermore, important restriction sites are indicated with their respective cutting posi- 
tion. 



25 ° 12539 bp) 
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Abbreviation 


Feature 


Position (Bas( 


^No.) 


Orientation 


LB 


Left Border 




7691 


-7907 


direct 


STPT 


sTPT promoter 




7619 


-6302 


complement 


GUS 


GUS gene 




6248 


-4251 


complement 


35SpA 


3SS terminator 




4176 


-3972 


complement 


NitIP 


Nitl promoter 




3882 


-2071 


complement 


daao 


D*amlno acid o)ddase 


2027 


-921 


complement 


nosT 


nos terminator 




844- 


589 


complement 


i-Sce recognition/cleavage site 




445- 


•462 


direct 


SSSpA 


35B terminator 




400- 


196 


complement 


RB 


right border 




38-1B3 


direct 



40 



45 



ColEI 
aadA 
repA/pVS1 



ColEI origin of replication (E.coli) 
Spectomycin/Strepotomycin resistance 
repA origin of replication (Agrobacterium) 



Furthermore, Important restriction sites are indicated with their respective cutting posi- 
tion. The GUS gene is comrpising an intron (int). 
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Xli. Examples 



General methods: 

The chemical synthesis of oligonucleotides can be effected for example in the known 
5 manner using the phosphoamidite method (Voet, Voet, 2nel edition. Wiley Press 
New Yoric. pages 896-897). The cloning steps cam'ed out for the purposes of the pre- 
sent Invention, such as, for example, restriction cleavages, agarose gel electrophore- 
sis, purification of DNA fiagments. the transfer of nucleic acids to nitrocellulose and 
nylon membranes, the linkage of DNA fragments, the transformation of E. coli cells, 

10 bacterial cultures, the propagation of phages and the sequence analysis of recombi- 
nant DNA are carried out as described by Sambrook et al. (1 989) Cold Spring Harbor 
Laboratory Press; ISBN 0-87969-309-6. Recombinant DNA molecules were sequenced 
using an ALF Express laser fluorescence DNA sequencer (Phamiacia, Sweden) follow- 
ing the method of Sanger (Sanger et al., Proc, Natl. Acad. Sci. USA 74 (1977) 5463- 

15 5467). " 

Example 1: Vector construction and plant tfansformation 

DNA and RNA manipulation were done using standard techniques. 

20 The yeast R. graces was grown In liquid culture containing 30 mM D-alanine to induce 
daol. the gene encoding DAAO. Total RNA was isolated from the yeast and used for 
cDNA synthesis. The PCR primers 

S'-ATTAGATCTTACTACTCGAAGGACGCCATG-3' and 
25 S'-ATTAQATCTACAGCCACAATTCCCGCCCTA- 3^ 

were used to amplify the daol gene from tfie cDNA template by PCR. The PCR frag- 
ments were sub-cloned into the pGEM-T Easy vector (Promega) and subsequently 
ligated into the BamHl site of the CaMV 35S expression cassette of the binary vector 
30 pPCV702kana17 giving pPCV702:dao?. The vectors were subjected to restriction 
analysis and sequencing to check that they contained the con-eel constmcts. 

Example 1 a: Tranaformation of Arabidopsis thallana 

A. thaliana plants (ecotype Col-O) were grown in soil until they flowered. Agrobactarium 
35 tumefaciens (strain GV3l01:pMP110 RK) transformed with the construct of interest 
was grown In 500 mL in liquid YEB medium (5 g/L Beef extract, 1 g/L Yeast Extract 
(Duchefa), 5 g/L Peptone (Duchefa), 5 g/L sucrose (Duchefa), 0,49 g/L MgS04 
(Merck)) until the culture reached an ODeoo 0-8-1 .0. The bacterial cells were harvested 
by centrifugation (15 minutes, 5.000 rpm) and resuspended in 500 mL infiltration solu- 
40 tion (5% sucrose, 0.05% SILWET L-77 Idistributed by Lehle seeds. CatNo. VIS-02]). 

Flowering A thaliana plants were then transformed by the floral dip method (Clough Sj 
& Bent AF (1998) Plant J. 16, 735-743 (1998) with the transgenic Agrobacterium tumefa- 
ciens strain carrying the vector described above by dipping for 10-20 seconds into the 
45 Agrobacterium solution. AftenA^ards the plants were kept in the greenhouse until seeds 
could be harvested. Transgenic seeds were selected by plating surface sterilized seeds 
on growth medium A (4.4g/L MS salts [Sigma-Aldrich], 0.5g/L MES [Duchefa]: Sg/L 
Plant Agar [Duchefa]) supplemented with SO mg/L kanamycin for plants carryirig the 
nptll resistance marker, or 0.3 to 30 mM D-amino adds (as described below) for plants 
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comprising the dual-funrfion marker of the invention. Surviving plants were transferred 
to $oil and grown in tfie greenhouse. 

Lines containing a single T-DNA insertion locus were selected by statistical analysis of 
5 T-DNA segregation in the T2 population that gemiinated on kanamycin or D-amino add 
-containing medium. Plants with a single iocus of inserted T-DNA were grown and seif- 
fertilized. Homozygous T3 seed stocks were then identified by analyzing T-DNA segre- 
gation in T3 progenies and confimied to be expressing the introduced gene by northern 
blot analyses. 

10 

Example lb: Agrobacterium-fliediated fransformation of Braasiea napas 

Agrobacterium tumefaciens strain GV3101 transfonned with the plasmid of interest was 
grown in SO mL YEB medium (see Example 4a) at 28*^(5 overnight. The Agrobacterium 
solution is mixed with liquid co-cultivation medium (double concentrated MSBS salts 

IS (Duchefa), 30 g/L sucrose (Duchefa), 3.7S mgfl BAP (6-benzylamino purine, Duchefa) 
0.5 g/l MES (Duchefa), 0,5 mg/l GA3 (Gibberellic Add, Duchefa); pH5,2) until ODaoo of 
0.5 is reached. Petiols of 4 days old seedlings of Bmssica napus cv. Westar grown on 
growth medium B (MSBS salts (Duchefa), 3% sucrose (Duchefa), 0.8% oxoidagar 
(Oxoid GmbH); pH 5,8) are cut. Petiols are dipped for 2-3 seconds in the Agrobaete- 

20 rium solution and aften<vards put into solid medium for co-cultivation (co-cultivation me- 
dium supplemented with 1 .6% Oxoidagar). The co-cultivation lasts 3 days (at 24''C and 
-SO pMol/m^s light intensity). Aflenvards petiols are transferred to co-cultivation me- 
dium supplemented vwth the appropnate selection agent (18 mg/L kanamycin 

I— MMt.t|^..5iii0 Mis aiH"' iiiciiiv« naiiciiiiyuiii lor pidnis Carrying ihe nptii 

25 resistance marker, or 0.3 to 30 mM D-amino acids; as described below) for plants 
comprising the dual-function marker of the invention) and 300 mg/L Timetin (Duchefa) 

Transfomied petioles are incubated an the selection medium for four weeks at 24°C. 
This step is repeated until shoots appear- Shoots are transfen-ed to A6 medium (MS 

30 salts (Sigma Aldrich), 20 g/L sucrose, 100 mg/L myonnositol (Duchefa), 40 mg/L 
adeninesulfate (Sigma Aldrich), 500 mg/L MES, 0.0025 mg/L BAP (Sigma). S g/L oxoi- 
dagar (Oxoid GmbH), ISO mg/L timetin (Duchefa), 0.1 mg/L IBA (indol butyric acid, 
Duchefa); pH 5,8) supplemented wifri the appropriate selection agent (18 mg/L kana- 
mycin (Duchefa) for plants comprising the nptll marker kanamycin for plants canying 

35 the nptll resistance marker, or 0.3 to 30 mM D-amino acids; as described below) until 
they elongated. Elongated shoots are cultivated in A7 medium (AS medium without 
BAP) for rooting. Rooted plants are transferred to soil and grown in the greenhouse. 

Example 2: Selection analysis. 

40 T1 seeds of transgenic Arabidopsis plants were surface-sterilized and sown in Petri 
plates that were sealed with gas-penneable tape. The growth medium was half 
strength MSI 9 with 0.5% (wt/vol) sucrose and 0.B% (wt/vol) agar, plus 3 mM D- 
alanine, 3 mM D-serine or SO pg/ml kanamycin as the selective agent Plants were 
grown for 5 d after germination vwth a 16 h photoperiod at 24 ^'C. To evaluate the selec- 

45 tion efficiency on different substrates, 2,074, 1,914 and 1.810 T1 seeds were sown on 
D-alanine-, D-serine- and kanamycin-selective ptate&i respectively, and the number of 
surviving seedlings was counted (44. 32 and 43, respectively). 
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Example 3. Toxicity studies. 

To evaluate the toxic action of 3-methyl-2-oxopentanoate and 3-methyl-2- 
oxobutanoate, wild-type plants were sown on two sets of half strength MS agar plates, 
each containing one of the compounds in a range of concentrations (0.01-10 mM). 
5 Plants were slightly affected by 3-methyl-2- oxopentanoate at 0.1 mM, and total growth 
Inhibition was observed at 1 mM. For 3-methyl-2«oxobutanoate, 5 mM was required for 
complete inhibition. Further, several attempts ware made to probe the nature of D- 
serine's toxicity. In accordance with studies on E. co//, wildtype It was tried to rescue A 
thaliana grown on lethal concentrations of D-serine through amendments with five po- 
1 0 tential inhibitors of D-serine toxicity (L-serine. calcium- pantothenate, p-alanine, leucine 
and threonine) added both separately and in combinations in a very wide range of con- 
centrations (0.001-60 pg mM), without any success. 

Example 4: Enzyme assays 

15 Soluble proteins were extracted by shaking 0.1 g samples of plant rnaterial that had 
been finely pulverized in a 1.5 ml Eppendorf tube in 1 ml of 0.1 M potassium phosphate 
buffer, pH 8. DAAO activity was then assayed as follows. Reaction mixtures were pre- 
pared containing 2,120 pi of 0.1 M potassium phosphate buffer^ pH 8, 80 pi of crude 
protein extract and 100 pi of 0.3 M D-alanine. The samples were incubated for 2 h at 

20 30 ''C. The enzyme activity was then assessed, by measuring the increase in absorb- 
ance at 220 nm (E = 1 .090 M"*** cm"') associated with the conversion of D-alanine to 
pyruvate, after transferring the test tubes to boiling water for 1 0 min to stop the reac-* 
tion. In control reactions, D-alanine was added immediately before boiling. One unit of 
DAAO activity is defined as the turnover of one micromole of substrate per minute, and 

25 activity was expressed per gram plant biomass (fresh weight). The breakdown of D- 
isoleucine and 0-valine in DAAO incubations, and the associated production of 3- 
methyl-2-oxQpentanoate and 3-methyK2-oxQbutanoate, were analyzed by high- 
performance liquid chromatography. In other respects the reactions were carried out as 
described above. 

30 

Example S: Dual-Function Selection Marker 

The qualification of the DAAO enzyme as a dual-function selection marker was demon- 
strated by testing germinated T1 seeds on different selective media. The T-DNA con- 
tained both 3SS:daa1 and pNosinpf//, allowing Dsamlho acid and kanamycin selection 
35 to be compared in the same lot of seeds. 

T1 seeds were sown on medium containing kanamycin (50 pg/ml), D-alanlne (3 mM) or 
D-serine (3 mM), and the transformation frequencies found on the different selective 
media were 2.37%, 2.12% and 1.67%, respectively (Fig. 5a-c). D-alanine had no nega- 
40 tive effect on the transgenic plants, even at a concentration of 30 mM, but at this con- 
centration, D-serine induced significant growth inhibition. Fewer transgenic plants were 
found after selection on 3 mM D-serine because tlie compound slightiy inhibited the 
growth of the transgenic plants at this concentration. 

45 Further studies using lower concentrations corroborated this conclusion, and efficient 
selection using D-serine was achieved on concentrations lower than 1 mM (Fig. 4a). 
Progeny from the transgenic lines selected on D-serine and D-alantne were later con- 
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firmed to be kanamydn resistant, hence ensuring there would be no wild*type escapes 
from these lines. 

Selection of seedlings on media containing D-alanine or D-serine was very rapid com- 
5 pared to selection on kanamydn. These D-amino acids inhibited growth of wild-type 
plants Immediately after the cotyledons of wild-type plants had emerged. Therefore, 
transfonmants could be distinguished from non-transformed plants directly after gemni- 
nation. The difference between wild-type and transgenic plants after D-amino acid se- 
lection was unambiguous, with no intermediate phenotypes. In contrast, intermediate 

10 phenotypes are common when kanamydn resistance is used as a selection marker 
(Fig. Sc). Furthermore, wild-type seedlings were found to be sensitive to sprayed appli- 
cations of D-serine and D-alanine. One-week-old seedlings were effectively killed when 
sprayed on three consecutive days with either 50 mM D-serine or D-alanine, although 
the sensitivity of wild-type plants rapidly decreased with age, presumably because as 

15 the cuticle and leaves became thicker, uptake by the leaves was reduced. Transgenic 
seedlings were resistant to foliar application of D-alanine or D-serine, so selection on 
soil was possible (Fig. 5d, 11). 

Transgenic plants grown under D-alanine and D-serine selection conditions developed 
20 normally. Eariy development of transgenic plants from line 3:7, 10:7 and 13:4 was 
compared with that of wild-type plants by cultivation on vertical agar plates. No differ- 
ences in biomass, number of leaves, root length or root architecture were detected for 
the different sets of plants. Furthermore, soil-cultivated wild-type and transgenic plants 
(line 1 0:7) showed no differences in the total number of rosette leaves, number of inflo- 
25 rescences and number of siliqua after 4 weeks of growth. 

Also, the phenotypes of 17 individual T1 lines, which were picked for T-DNA segrega- 
tion, were studied and found indistinguishable from that of wild type when grown on 
soli. A problem sometimes encountered after selection on antibiotics is the growth lag 

30 displayed by transformants. This phenomenon is explained as an inhibitory effect of the 
antibiotic on the transgenic plants (Lindsay K & Gailois P (1 990) J. Exp. BoL 41 , 529- 
536). However, unlike seedlings picked from antibiotic selection plates, transgenic 
seedlings picked from D-amino acid selection plates and transferred to soil were not 
hampered in their growth and development, even temporarily. A possible reason for 

35 this difference Is that the DAAO scavenging of D-amino acids may effectively remove 
the D-amino acid in the plants. Furthermore. D-alanine and D-serine may merely pro- 
vide additional growth substrates, because their cataboUc products are carbon and 
nitrogen compounds that are central compounds in plant metabolism. Quantification of 
dao1 mRNA from six Independent D-alanine- and D-serine-resistant lines showed a 

40 range of different expression levels (Fig. 5e). These different expression levels were 
mirrored in a range of different DAAO activities (Fig. 5f). In spite of these differences in 
mRNA levels and enzyme activities, no phenotypfc variation associated with the D- 
serine and D-alanine treatment was found, suggesting that the DAAO marker is effeop- 
tive over a range of expression levels. As described above, D-isoleucine and D-valine 

45 were found to inhibit growth of the transgenic plants, but not the wild-type plants. 

Therefore, plants containing the construct were tested as described above on two sets 
of media, one containing D-isoleucine and the other containing D-valine at various con- 
centrations» to assess whether DAAO could also be used as a counter-selection 
50 marker. Unambiguous counter-selection selection was achieved when seeds were 
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sown on either D-tsoleudne or D-valine at concentrations greater than 1 0 mM (Fig. 4 
c,d). 

Thirteen individual lines expressing DAAO were tested for their response to l> 
5 isoleucine and all of them were effectively killed, whereas wild-type plants grew well, 
with no sign of toxicity. Similar results were obtained for D-valine, although this com- 
pound was found to have a moderately negative effect on VMldHiype plants at higher 
concentrations (Fig. 4 d). The keto acid produced in DAAO catabolism of [>iso!eucine 
IS the same as that formed when L-isoleudne is metabolized by the endogenous 
10 branched-chain amino acid transaminase [EC: 2.6.1.42], namely 3-methyl-2- 
oxopentanoate (Kyoto Encyclopedia of Genes and Genomes, metabolic pathway web- 
site. http://www.genome.ad.jp/ kegg/metabolism.html). • 

Presumably endogenous transaminase may be specific for the L-enantiomer, so the 
15 corresponding D-enantiomer is not metabolized in wildtype plants, but only in plants 
expressing DAAO. The negative effects of L-isoIeudne (but not of the D-form) ob- 
served on wildtype plants, supports this speculation, incubation of cell-free extracts 
from tfaof transgenic line 10:7 with D-isoleucine and D-valine resulted in 15~fold and 7- 
fold increases in production of 3-methyl-2-oxopentanoate and 3-methyl-2- 
20 oxobutanoate, respectively, compared to extracts of wild-type plants. Further, 3-methyi- 
2-oxopentanoate and 3- methyl-2-oxobutanoate impaired growth of A. thaliana, cor- 
roborating the suggestion that these compounds, or products of their metabolism, are 
responsible for the negative effects of D-isoieucine and D-valine on the transgenic 
plants. 

25 

The toxicity of some D-amino acids on organisms is not well understood, and has only 
occasionally been studied in plants (Gamburg KZ & Rekoslavskaya Nl (1991) 
Fiziologiya Rastenii 38, 1236-1246). Apart from A thaliana, we have also tested the 
susceptibility of other plant species to D-serine, including poplar, tobacco, barley, 

30 maize, tomato and spruce. We found all tested species susceptible to I>serine at con- 
centrations similar to those shown to be toxic for A. Utaliana. A proposed mechanism 
for D-serine toxicity in bacteria is competitive inhibition of a-alanine coupling to pantoic 
acid, thus inhibiting formation of pantothenic acid (Cosloy SD & McFall E (1 973) J. Bao 
terioL 1 14, 685-694). It is possible to alleviate 0-serine toxicity in D-serine- sensitive 

35 strains of Escherichia coli by providing pantothenic acid or ^-alanine in the medium, but 
D-serine toxicity in A. thatiana could not be mitigated using these compounds. A sec- 
ond putative cause of D-amino acid toxicity is through competitive binding to tRNA. 
Knockout studies of the gene encoding D-Tyr-tRNATyr deacylase in £. coli have shown 
that the toxicity of D-tyrosine increases in the absence of deacylase activity (Soutourina 

40 J et aL (199B) J. Biol. Chem. 274, 19109-^19114), indicating that D-amIno acids inter- 
fere at the tRNA level. Genes similar to that encoding bacterial deacylase have also 
been identified in A. thaliana (Soutourina J et al. (1996) J. Biol. Chem. 274, 19109- 
19114), corroborating the possibility that the mode of toxic action of D-amino acids 
might be through competitive binding to tRNA. 

45 

Example 6: Constructs useful for self-excising expression cassettes using i-Scel 

Two expression constructs are constructed for carrying out the present invention (SEQ 
ID NO: 15, 16). The backbone of both plasmid constructs (pSUN derivative) contains 
origins for the propagation in £ coli as well as in Agrobacterium and an aadA expres- 
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slon cassette (conferring spectinomycin and streptomycin resistance) to refect for 
transgenic bad:eria cells. The sequences for constructing the DMA constructs are am-- 
plified Incorporating the appropriate restriction sites for subsequent cloning by PGR. 
Cloning was done by standard methods as described above. The sequence of the con- 
5 structs is verified by DMA sequence analysis. 

* 

Example 6a: DAAO driven by constitutive Nitriiase Promoter 

The first ONA construct (SEQ ID NO: 16} comprises an expression cassette for the 
amino add oxidase (DAAO) firom Rhodotorula gracilis under control of the Arabidopsis 
10 thaliana Nitriiase promoter. The DAAO cassette is flanked by a direct repeat of the 35S 
terminator functioning both as transcription terminator of the DAAO expression cas- 
sette and as homology sequences. 

Further comprised is a expression cassette for the ^-glucuronidase which may function 
15 as a substitute for an agronomically valuable trart under control of the Arabidopsis 
sTPT promoter (i.e. TPT promoter truncated version, WO 03/006660; SEQ ID NO: 27 
cited therein), and the CaMV 3SS terminator. 

Example 6b: Self-«xclsable DAAO cassette 

20 The second DNA (SEQ ID NO; 15) comprises an expression cassette for the O-amino 
acid oxidase (DAAO) from Rhodotorula gracilis under control of the Arabidopsis 
thaliana Nitriiase promoter. The DNA construct further comprises a Tet on expression 

tfj^&t^iiio ■ti*^ ^iaii#W«a iw< aiitauuSwi i caaiur i wi UlS IIIJIIIIII^ CI lUUI luClCSaoiC wrilUII 

is placed under control of a Tet-regulatabie promoter The system further requires ex- 
25 pression of the Tet-repressor, which is realized under control of the constitutive ptXA 
promoter from Pisum sativa. 

Both the sequences encoding the DAAO cassette, the l-Sce-l expression cassette, and 
the rtTA expression cassette (for the reverse tetracycline responsive repressor) are 
30 flanked by a direct repeat of the 35S terminator functioning both as transcription termi- 
nator of the l-Sce-l expression cassette and as homology sequences. 

Example 7: Use of the constructs for the method of the invention 

Example 7.1: Co-transformation 

35 Arabidopsis thaliana plants are transformed as described above with a mixture of DNA 
construct i (binary vector SEQ ID NO: 16) and a second binary vector comprising a 
C3FP (green fluorescence protein) expression cassette. In a first selection process 
transgenic plants are selected comprising both constructs by employing D-alanine me- 
diated selection. 3 mM and 30 mM D-alanine are used. 

40 

D>alanine resistant plants comprising the first DNA construct (detectable by GUS stain- 
ing) also comprising the gfp gene (as assessed by green fluorescence) are isolated 
and crossed with wild-type plants. Resulting seeds are used for a second counter- 
selection process, wherein said seeds are germinated on D-isoleucine comprising me- 
45 dium (comprising either 3 mM or 30 mM D-isoleucine). D-isoleudne resistant plants — 

comprising the gfp gene - can be easily selected. i 
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Example 7.2: Marker excision 

Arabidopsis thaliana plants are transformed as described above with a mixture of DMA 
construct 1 (binary vector SEQ ID NO: 16). In a first selection process transgenic plante 
are selected comprising constmct I by employing D-alanine mediated selection. 3 mM 
5 and 30 mM D-alanine are used. 

D-alanine resistant plants comprising the first DNA construct (detectable by GUS stain- 
ing) are isolated and crossed with a transgenic master plant comprising a transgenic 
expression cassette for the l-Sce-l homing endonudease under control of a constitutive 
10 promoter (as described in WO 03/004659). Resulting seeds are used for a second 
counter-selection process, wherein said seeds are germinated on D-isoleucine com- 
prising medium (comprising either 3 mM or 30 mM D-isoleucine). D-isoleudne resistant 
plants still comprising the G US-expression cassette can be easily selected. 

1 5 Example 7.3: Use of a self-excisable marker cassette 

Arabidopsis thaliana plants are transformed as described above with a mixture of DNA 
construct II (binary vector SEQ ID NO: IS). In a first selection process transgenic plants 
are selected comprising construct II by employing D-alanine mediated selection. 3 mM 
and 30 mM D-alanine are used. 

20 

D-alanine resistant plant cells comprising the DNA construct 11 are isolated and further 
cultivated on medium lacking D-alanine. Doxycycline (Sigma; 1 to 5 pg/ml) is added for 
induction of the marker excision process and cells are incubated for 24 to 48 h on said 
induction medium. Subsequently cells are further incubated for 3 to S days on medium 

25 lacking the inducer and D-amino acids (to allow for reduction of DAAO protein levels 
from prior expression). The resulting cells are used for a second counter-selection 
process, wherein said cells are further selected on D-isoleucine comprising medium 
(comprising either 3 mM or 30 mM D-isoleucine). Selected D-isoleucine resistant cells 
are regenerated Into fertile plants and assessed for their transgenic status. By PGR 

30 mediated analysis it can be demonstrated that the region flanked by the 3SS terminator 
sequences was accurately excised from the plant genome deleting both the l-Scel ex- 
pression cassett, the DAAO expression cassette, and the rtTA expression cassette (for 
the reverse tetracycline responsive repressor) 
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1. A method for producing a transonic plant comprising: 

0 transforming a plant call with a first expression cassette comprising a nucleic acid 
sequence encoding a D-ammo add oxidase operably linked with a promoter al- 
lowing expression in plant cells or plants, in combination with at least one second 

^elS'^an'd '^"^^"^^ *** P'^"* ^" agronomically valu- 

ii) providing at least one first compound X. which is phytotoxic against plant cells 
not functionally expressing said D-amino acid oxidase, wherein said compound X 
can be metabolized by said D-amino acid oxidase into one or more compound{sJ 
Y which are non-phylotoxic or less phytotoxic than compound X. and 

ill) treating said transfomied plant cells of step i) with said first compound X in a 

SrT**."*^?" ^""^ P'^"* ""^^^ comprising in their genome 

bom sard first and said second expression cassette, wherein said first expression 

20 ^.nH ^ K re^stance to said transfomied plant cells against Mid com- 

20 pound X by expression of said D-amino add oxidase, and 

iv) providing at least one second compound M. which is non-phytotoxic or moder- 
ately phytotoxic against plant ceils not ftjnctionally expressing said D-amino add 
oxidase, wherein said compound M can be metabolized by said D-amlno acid 
oxidase into one or more compound{s) N which are phytotoxic or more phytotoxic 
than compound M, and lyiuiuAie 

said first expression cassette and said second 
expression cassette and treating resulting said plant cells with said second com- 
pound M in a concentration toxic to plant cells still comprising said first expres- 
sion cassette, and selecting plant cells comprising said second expression cas- 
sette but ladong said first expression cassette. 

2. The method of Claim 1. wherein said first expression cassette for said D-amino acid 
oxidase and said second expression cassette for said agronomically valuable trait 



25 



30 



35 



40 



a) both comprised in one DNA construct and combination is broken by deletion or 
excision of said first expression cassette for said D-amino acid oxidase, or 

b) are comprised on separate DMA constructs which are transformed in combination 
by co-transformation into said plant cells, and combination is broken by subse- 
quent segregation of the two expression cassettes. 



45 
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3. The method of Claim 1 and 2, wherein said method for producing a transgenic plant 
comprises the steps of: 

i) transforming a plant cell with a first DMA construct comprising 

a) a first expression cassette comprising a nucleic acid sequence encoding a D- 
amino acid oxidase operably linked with a promoter allowing expression In 
plant cells or plants, wherein said first expression cassette is flanked by se- 
quences which allow for specific deletion of said first expression cassette, and 

b) at least one second expression cassette suitable for confemng to said plant an 
agronomically valuable trait wherein said second expression cassette Is not 
localized between said sequences which allow for specific deletion of said first 
expression cassette, and 

ii) providing at least one first compound X, which is phytoto3dc against plant cells 
not functionally expressing said D-amino acid oxidase, wherein said compound X 
can be metaboli2ed by said D-amino add oxidase into one or more compound(s) 
Y which are non-phytotoxic or less phytotoxic than compound X, and 

iii) treating said transformed plant cells of step i) with said first compound X in a phy- 
totoxic concentration and selecting plant cells comprising in their genome said 
first DNA construct, conferring resistance to said transformed plant cells against 
said compound X by expression of said D-amino acid oxidase, and 



iv) providing at least one second compound M, which is non-phytotoxic or moder- 
ately phytotoxic against plant cells not functionally expressing said O-amrno acid 
oxidase, wherein said compound M can be metabolized by said D-amrno acid 
oxidase into one or more compound(s) N which are phytotoxic or more phytotoxic 
30 than compound M, and 

V) inducing deletion of said first expression cassette from the genome of said trans- 
formed plant cells and treating said plant cells with said second compound M in a 
concentration toxic to plant cells still comprising said first expression cassette, 
35 thereby selecting plant cells comprising said second expression cassette but 

lacking said first expression cassette. 



4. The method of any of Claim 1 to 3 further comprising the step of regeneration of a 
fertile plant. 

5. The method of any of Claim 1 to 4, wherein said first compound X is comprising a D- 
amino acid structure selected from the group consisting of D-tryptophane, D- 
histidine, D-srginlne, ID-threonine. D-methionine, Oserine, and D-alanine, and de- 
rivatives thereof. 
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J^^method of any of Claim 1 to 5, wherein said second compound M is comprisinq 
a p-amino aad structure selected from the group consisting of D-jsoleucine 
valine, D-asparagine. D-ieucine, D-lysine. D-pniline, and D^iutamine and deriv» 
fives thereof. ' ucuva- 

7. The method of any of Claim 1 to 6. wherein deletion of said first expression cassette 
for the D-amino aad oxidase Is realised by a method selected fixim: 

a) recombination induced by a sequence specinc recombinase. wherein said first 
expression cassette Is flanked by corresponding recombination sites in a way 
Oiat recombination between said flanking recombination sites results in deletion 
of the sequences in-between from the genome, or 

b) homologous recombination between homology sequences A and A' flanking said 
first expression cassette, preferably induced by a sequence-specific double- 
strand break caused by a sequence specific endonudease, wherein said homol- 
ogy sequences A and A* have sufficient length and homology In order to ensure 
homologous recombination between A and A', and having an orientation which - 
upon recombination between A and A' - will lead to excision of said first expres- 
sion cassette from the genome of said plant 

8. The method of Claim 7. wherein the recombinase or sequence-specific endonude- 
ase. respectively, is expressed or combined with its corresnondlng reGomhin^ti^o 
recognition site, respecOvely, by a method selected from the group consisting oft 

a) incorporation of a second expression cassette for expression of the recombinase 
or sequence-specific endonudease operably linked to a plant promoter into said 
DISIA construct, preferably together with said first expresskm cassette flanked by 
said sequences which allow for specific deletion, or 

b) incorporation of a second expression cassette for expression of the recombinase 
or sequence-specific endonudease operably linked to a plant promoter into the 
plant cells or plants used as target material for the transformation thereby aener- 
ating master cell lines or cells, or 

c) incorporation of a second expression cassette for expression of the recombinase 
or sequence-specific endonudease operably linked to a plant promoter into a 
separate DNA construct, which is transformed by way of co-transfonnation with 
said first DNA construct into said plant cells, or 

d) incorporation of a second expression cassette for expression of the recombinase 
or sequence-specific endonudease operably linked to a plant promoter into the 
plant cells or plants which are subsequently crossed with plants comprising the 
DMA construct of the invention. a 
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a The method of Claim 7 or 8, wherein deletion of said first expression cassette for the 
D^arnino acid oxidase is induced or activated by indudng expression and/or activity 
of sard sequence-specific recombinase or endonuclease by a method selected from 
the group consisting of 

a) inducible expression by operably linking the sequence encoding said recombi- 
nase or endonuclease to an inducible promoter and 

b) inducible activation, by employing a modified recombinase or endonuclease 
comprising a llgand-binding-domain, wherein activity of said modified recombi- 
nase or endonuclease can by modified by treatment of a compound having bind- 
ing activity to said ligand-binding-domain. 

10. The method of any of Claim 1 to 9, wherein the DMA construct Is defined as In any 
of Claim 11 to 20 and the resulting plant cell or plant Is selection marker frea. 

1 1- A DNA constructs suitable for the method of any of Claim 1 to 10, comprising 

a) a first expression cassette comprising a nucleic acid sequence encoding a D- 
amino acid oxidase operably linked with a promoter allowing expression In 
plant cells or plants, wherein said first expression cassette is flanked by se- 
quences which allow for specific deletion of said first expression cassette, and 

b) at least one second expression cassette suitable for conferring to said plant an 
agronomically valuable trait, wherein said second expression cassette is not 
localized between said sequences which allow for specific deletion of said first 
expression cassette. 

12. The DNA construct of Claim 11, wherein said D-amino add oxidase expressed 
from said first expression cassette has preferably metabolizing activity against at 
least one D*amino acid and comprises the following consensus sequence: 

[LIVM>[LIVM]-H"HJslHA]-Y-G-x-IGSAHGSA3-x-G-Xs- 

wherein amino acid residues given in brackets represent alternative residues for 
the respective position, x represents any amino acid residue, and indices numbers 
indicate the respective number of consecutive amino acid residues. 

13. The DNA construct of Claim 11 or 12, wherein said D-amino add oxidase has en- 
zymatic activity against at least one of the amino acids selected from the group 
consisting of D-alanine« D-serine, D-isoleucine, D-valine, and derivatives thereof. 

14. The DNA construct of any of Oaim 1 1 to 13 wherein said D-amino acid oxidase is 
described by a sequence of the group consisting of sequences described by Gen- 
Bank or SwisProt Acc.No. JX0152, O01739, 033145, 035078, O4S307, P00371, 
P14920, P18894, P22942. P24S52, P31228. P80324. Q19564, Q2S382, Q7PWX4! 
Q7PWYa. Q7Q7G4, Q7SFW4, Q7Z312, Q82MI8. Q86JV2. QaN552, Q8P4M9. 
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Q8PG9S. Q8R2R2, Q8S2NS, Q8VCW7. Q921M5. Q92220, Q95XG9 Q99045 
Q99489. Q9C1L2. Q9JXF8, Q9V5P1. QSVMSO. Q9X7P6. Q9Y7N4 'q921Ms' 
Q9Z302, and U60066. vio^iiwia. 

5 15. The DMA construct of any of Claim 1 1 to 14 wherein said Domino acid oxidase is 
selected from the group of amino acid sequences consisting of 

a) the sequences described by SEQ ID NO: 2. 4, 6, 8. 10. 12. and 14. and 

b) the sequences having a sequence homology of at least 40% with a sequence 
as described by SEQ ID NO: 2. 4, 6. 8, 1 0. 12, and 14, and 

c) the sequences hybridizing under low or high stringency conditions with a se- 
quence as described by SEQ ID NO: 2. 4. 6. 8, 10. 12, and 14. 

16. The ONA construct of any of Claim 11 to 15. wherein said sequences which allow 
for specific deletion of said first expression cassette are selected from the arouo of 
sequences consisting of 

a) recombination sites for a sequences-specific recombinase arranged in a way 
that recombination between said flanidng recombination sites results in deletion 
of the sequences in-between from the genome, and 

b) homology sequences A and A' having a sufficient length and homolon 

25 to ensure homologous recombination between A and A', and havirig an orienta- 

tion which - upon reconnbination between A and A' - will result in deletion of the 
sequences in-between from ttie genome. 

17. The DNA constnict of Claim ie, wherein said recombination sites correspond to a 
30 recombinase selected from the group consisting of the ere recombinase tiie FLP 

recombinase, the Gin recombinase. the Rn recombinase, and the R recornblnase. 

18. The DNA construct of Claim 16. wherein said DNA construct comprises a recogni- 
tion site of at least 10 base pairs for a sequence specific endonuclease between 

35 said homology sequences A and A'. 

19. The DNA construct of Claim 18. wherein said recognition site corresponds to a 
sequence-specific endonuclease selected from the group consisting of the homing 
endonucteases l-Scel. I-Cpal, l-Cpall, l-Crel. and l-Chul and chimeras thereof with 

40 llgand-bmdlng domains. 



20 



20. The DMA construct of any of Claim 16 to 19, wherein said DNA construct further 
comprises a expression cassette for the sequence specific endonuclease or re- 
combinase suitable for mediating deletion of the first expression cassette for the D- 
45 amino acid oxidase. 
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21. The DIMA construct of Claim 20, wherein expression and/or activity of sard se- 
quence-spedfic recombinase or endonuclease can be induced and/or activated by 
a method selected from the group consisting of 

5 a) indudble expression by operably linking the sequence encoding said recombi- 

nase or endonuclease to en inducible promoter, 

b) inducible activation, by employing a modified recombinase or endonuclease 
comprising a ligand-bfnding-domain, wherein activity of said modified recombi- 
10 nase or endonudaase can by modified by treatment of a compound having 

binding activity to said ligand-binding-domain. 

22. A transgenic vector comprising a DNA construct of any of Claim 11 to 21. 

15 23. A transgenic cell comprising a DNA construct of any of Claim 11 to 21 or a vector 
of Claim 22. 

24. A transgenic cell of Claim 23, wherein said cell is a plant cell. 

20 25, A transgeniCi non-human organism comprising a DNA construct of any of Claim 1 1 
to 21, a vector of Claim 22. or a transgenic cell of Claim 23 or 24. 

26. A transgenic, non-human organism of Claim 25 wherein said organism is a plant. 



10 



820" d 280: ■ Ju* m 60:Z.l ^ppS/SO/a U |B2" :^diiJ3 

BASF Plant Science GmbH go 

IMPROVED CONSTRUCTS FOR MARKER EXCISION BASED ON DUAL 
FUNCTION SELECTION MARKER 

SUMMARY 

The invention relates to improved construct and methods for eliminating maker se- 
quences from the genome of plants, based on dual-function selection marker which - 
depending on the employed compound - can act as both negative and counter- 
selection marker. 
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<210> 
<211> 
<212> 
<213> 

<220> 

<222> 
<223> 



1 

1160 
DNA 

Rhodosporidium toruloides 



CDS 

(1) . . (1104) 
coding for DAAO 



<400> 1 

atg cac teg cag aag cgc gtc gtt gtc etc gga tea ggc gtt ate ggt 

Met His Ser Gin Lys Arg Val Val Val Leu Gly Ser Gly Val lie Gly 

15 10 15 

ctg age age gee etc ate etc get egg aag gge tae age gtg cat att 

Leu Ser Ser Ala Leu lie Leu Ala Arg Lys Gly Tyr Ser Val His lie 

20 25 30 



4B 



36 



35 etc gcg cgc gae ttg ccg gag gac gtc teg age cag act ttc get tea 
Leu Ala Arg Asp Leu Pro Glu Asp Val Ser Ser Gin Thr Phe Ala Ser 
35 40 45 



144 



eca tgg get ggc gcg aat tgg acg ect ttc atg acg ctt aca gac ggt 
40 pro Trp Ala Gly Ala Asn Trp Thr Pro Phe Met Thr Leu Thr Asp Gly 
50 55 60 



192 



45 



cct cga caa gca aaa tgg gaa gaa teg act ttc aag aag tgg gtc gag 
Pro Arg Gin Ala Lys Trp Glu Glu Ser Thr Phe Lys Lys Trp Val Glu 
65 70 75 80 



240 
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ttg gtc ccg acg ggc cat gcc atg tgg etc aag ggg acg agg egg ttc 28 B 

Leu Val Pro Thr Gly His Ala Met Trp Leu Lys Giy Thr Arg Arg Phe 

85 do 95 

5 

gcg cag aac gaa gac ggc ttg etc ggg cac tgg bac aag gac ate acg 336 
Ala Gin Asn Glu Asp Gly Leu Leu Gly His Trp Tyr Lys Asp lie Thx 

100 105 110 

10 cca aat tac cgc ccc etc cca tct xlcc gaa tgt cca cot ggc get ate 384 

Pro Asn Tyr Arg Pro Leu Pro Ser Ser Glu Cys Pro Pro Gly Ala lie 
115 120 125 

ggc gta ace rac gac ace etc tec gtc cac gca cca aag tac tgc cag 432 

15 Gly Val Thr Tyr Asp Thr Leu Ser Val His Ala Pro Lys Tyr Cya Gin 

130 135 140 

tac ctt gca aga gag etg cag aag etc ggc gcg acg ttt gag aga egg 4 SO 

Tyr Leu Ala Arg Glu Leu Gin Lys Leu Gly Ala Thr Phe Glu Arg Arg 
20 145 150 155 160 



qCw g^i. aC^ w u b i^A^ ws»^ ^ok^ ^ tf^g t^Ak. uk.^ ^d^^ vi^^ 

Thr Val Thr Ser Leu Glu Gin Ala Phe Asp Gly Ala Asp Leu Val Val 

165 170 175 

aac get acg gga ctt ggc gcc aag teg att gcg ggc ate gac gac caa 57 6 

■ 

Asn Ala Thr Gly Leu Gly Ala Lys Ser He Ala Gly He Asp Asp Gin 

160 1B5 190 

30 gcc gcc gag cca ate cgc ggg caa acc gtc etc gtc aag tec cca tgc 624 

Ala Ala Glu Pro He Arg Gly Gin Thr Val Leu Val Lys Ser Pro Cys 
195 200 205 



aag cga tgc acg atg gac teg tec gac ccc get tct ccc gcc tac ate 672 

35 Lys Arg Cys Thr Met Asp Ser Ser Asp Pro Ala Ser Pro Ala Tyr He 

210 215 220 

att ccc cga cca ggt ggc gaa gtc ate tgc ggc ggg acg tac ggc gtg 720 

lie Pro Arg Pro Gly Gly Glu Val He Cys Gly Gly Thr Tyr Gly Val 
40 225 230 235 240 

gga gac tgg gac ttg tct gtc aac cca gag acg gtc cag egg ate etc 7 68 

Gly Asp Trp Asp Leu Ser Val Asn Pro Glu Thr Val Gin Arg He Leu 

245 250 255 

45 
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5 



10 



3 

aag eac tgc ttg cge etc gac cca acc at-.- t-r^rr =./,^ 

_ 3 ya<- fc-tg dcc ate teg age gac gga aeg ate 

Lys fli3 Cys Leu Arg ieu Asp Pro Thr He Ser Ser Asp Gly Thr Ila 

260 26S 270 

gaa ggc ate gag gte cte ege cae aac gtc ggc ttg cga cct gca cga 
. eiu Gly He Glu Val Leu Arg His Asn Val Sly Leu Arg Pro Ala Arg 

27S 280 

•=ga ggc gga ccc cge gtt gag gca gaa egg ate gte ctg cct etc gac 
Arg Gly Gly Pro Arg Val Glu Ala Glu Arg He Val Leu Pro Leu Asp 

293 

egg aea aag teg cce etc teg etc ggc agg ggc age gca oga gcg gcg 
Arg Thr Lys Ser Pro Leu Ser Leu Gly Arg Gly Ser Ala Arg Ala Ala 

315 • 

aag gag aag gag gtc acg ctt gtg eat gcg tat ggc ttc teg agt gcg 
r.ya Glu Lys Glu val Thr Leu Val His Ala Tyr Gly Phe Ser Ser Ala 

^"^■^ 330 "a-ac 

t J jj «^ * 

gga tac cag cag agt tgg ggc gcg gcg gag gat gtc gcg cag etc gtc 
Gly Tyr Gin Gin Ser Trp Gly Ala Ala Glu Asp Val Ala Gin L«u Val 

340 345 350 

25 gae gag gcg tte cag egg tac cae ggc gcg gcg egg gag teg aag ttg 
Asp Glu Ala Phe Gin Arg Tyr His Gly Ala Ala Arg Glu Ser Lys Leu 

360 



30 



tagggcggga tttgtggctg tattgcgggc atetacaaga aaaaaaaaaa 



<210> 2 

<211> 3^8 

<212> PRT 

35 <213> Rhodosporidium toruloides 

<400> 2 



40 



Met His Ser Gin Lys Arg Val Val Val Leu Gly Ser Gly Val He Gly 
^ ^ 10 15 

l.eu Ser Ser Ala Leu He Leu Ala Arg Lys Gly Tyr Ser Val His He 

20 25 30 

Leu Ala Arg Asp leu Pro Glu Asp Val Ser Ser Gin Thr Phe Ala Ser 
35 40 45 



816 



864 



912 



960 



1008 



1056 



1104 



Bsiaaaa 1160 
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Pro Trp Ala Gly Ala Asn Trp Thr Pro Phe Met Thr Leu Thr Asp Gly 

55 SO 



50 



5 Pro Arg Gin Ala Lys Trp Glu Glu Ser Thr Phe Lys Lys Trp Val Glu 

75 80 



65 



70 



10 



1S 



Leu Val Pro Thr Gly His Ala Met Trp Leu Lys Gly Thr Arg Arg Phe 

85 90 ®^ 

Ale Gin Aan Glu Asp Gly Leu Leu Gly His Trp Tyr Lys Asp He Thr 

100 105 

Pro Asn Tyr Arg Pro Lea Pro Sar Ser Glu Cys Pro Pro Gly Ala He 
115 120 125 

* 

Gly Val Thr Tyr Asp Thr Leu Ser Val His Ala Pro Lya Tyr Cys Gin 
130 ■ 135 140 



20 Tyr Leu Ala Arg Glu Leu Gin Lys Leu Gly Ala Thr Phe Glu Arg Arg 

-icft 155 160 

145 ISO 



Thr 



Val Thr Ser Leu Glu Gin Ala Phe Asp Gly Ala Asp Leu Val Val 



165 



170 



175 



25 



30 



Asn Ala Thr Gly Leu Gly Ala Lys Ser He Ala Gly He Asp Asp Gin 

180 IBS 190 

Ala Ala Glu Pro He Arg Gly Gin Thr Val Leu Val Lys Ser Pro Cys 
19S 200 205 

Lys Arg Cya Thr Met Asp Ser Ser Asp Pro Ala Ser Pro Ala Tyr He 
210 215 220 



35 He Pro Arg Pro Gly Sly Glu Val He Cys Gly Gly Thr Tyr Gly Val 
225 230 23S 240 



40 



45 



Gly Asp Trp J^p Leu Ser Val Asn Pro Glu Thr Val Gin Arg He Leu 

245 .250 255 

Lys His cys Leu Axg Lau Asp Pro Thr He Ser Ser Asp Gly Thr He 

260 265 270 

Glu Gly He Glu Val Leu Arg His Asn Val Gly Leu Arg Pro Ala Arg 
275 280 28S 
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Arg Gly Gly Pro Arg Val Glu Ala Glu Arg lie Val Leu Pro Leu Asp 
290 300 

5 Ara Thr Lys Ser Pro Leu Ser Leu Gly Arg Gly Ser Ala Arg Ala Ala 
305 310 315 320 

Lys Glu Lys Glu Val Thr Leu Val His Ala Tyr Gly Phe Ser Ser Ala 

325 330 335 



Gly Tyr Gin Gin Ser Trp Gly Ala Ala Glu Asp Val Ala Gin Leu Val 

340 345 350 



Asp Glu Ala Phe 61n Arg Tyr His Gly Ala Ala Arg Glu Ser Lys Leu 



15 • 





35S 360 




<:2io> 


3 




<211> 


1005 




<212> 


DUA 




<213> 


Caenorhabditlfl elegans 




<220> 






<:221> 


CDS 




<222> 


(1) (1002) 




<223> 


coding for DAAO 




<400> 


3 




atg gca aac ata att ceg aag att 


gca 


Met Ala Asn He He Pro Lys lie 


Ala 


1 


S 





365 



10 15 



gga tgt act tea gca ctt caa ata tea aaa get ata cca aat gcg aaa 
Gly Cys Thr Ser Ala Leu Gin He Ser Lys Ala Ila Pro Asn Ala Lys 

20 25 30 

ata act gtg etc cac gat aaa cca rtt aaa aaa teg tgc agt gca gga 
He Thr Val Leu His Asp Lys Pro Phe Lys Lys Ser Cys Ser Ala Gly 
35 40 4S 

40 cca gca gga tta ttt aga ate gat tat gag gag aat act gaa tac gga 
Pro Ala Gly Leu Phe Arg He Asp Tyr Glu Glu Asti Thr Glu Tyr Gly 
50 SS 60 

cgt get tct crc gcc tgg ttc tea cat. etc tac cgc act aca aaa gga 
46 Arg Ala Ser Phe Ala Trp Phe Ser His Leu Tyr Arg Thr Thr Lys Gly 



48 



96 



144 



192 



240 
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65 70 75 80 



tec gaa acc ggc gtg aaa tta gtt tec gga cat att caa tec gac aac 
Ser Giu Thr Gly Val Lys ieu Val Ser Gly His lie Gin Ser Asp Asn 

65 do gs 



2sa 



10 



ttg gag tea ttg aag caa caa caa aga gcc tat gge gat att gtg tac 
Leu Glu Ser Leu Lys Gin Gin Gin Arg Ala Tyir Gly Asp He Val Tyr 

100 . 105 110 

aac ttt aga ttc ttg gat gat aga gaa egg ctg gac att ttt ccc gaa 
Asn £»he Arg Phe Leu Asp Asp Arg Glu Arg Leu Asp He E>he Pro Glu 
115 120 125 



336 



384 



IS cca tea aag cac tgc att cac tac acc gcc tac gca tea gaa ggt aac 
Pro Ser Lys His Cys He His Tyr Thr Ala Tyr Ala Ser Glu Gly Asn 
130 135 140 



432 



20 



aag tac gtg cct tat ttg aag aat ttg ctg ctt gag caa aaa ate gag 
Lys Tyr Val Pro Tyr Leu Lys Asn Leu Leu Leu Glu Gin Lys He *Glu 
145 150 155 160 



480 



25 



ttc aag caa caa gaa gtg acg agt ttg gae gca gtc gcc gac get ggt 
Phe Lys Gin Gin Glu Val Thr Ser Leu Asp Ala Val Ala Asp Ala Gly 

165 170 175 



528 



30 



tac gat gtt att gta aac tgc gca ggc tfcg tac ggt gga aag ttg get 
Tyr Asp Val He Val Asn Cys Ala Gly Leu Tyr Gly Gly Lys Leu Ala 

180 185 130 

ggt gat gac gat act tgc tac ccc att aga gga gtc att ttg gaa gtt 
Gly Asp Asp Asp Thr Cys Tyr Pro He Arg Gly Val He Leu Glu Val 
195 200 205 



576 



624 



35 gat gca cca tgg cac aag cac ttc aat tat cga gae ttt act act ttc 
Asp Ala Pro Trp His Lys His Phe Asn Tyr Arg Asp Phe Thr Thr Phe 

210 215 220 



672 



aca att cca aaa gag cac age gtg gtg gtt ggg tec acc aag cag gac 
40 Thr He Pro Lys Glu His Ser Val Val Val Gly Ser Thr Lys Gin Asp 
22S 230 235 240 



720 



45 



aat cga tgg gat ttg gag ate acc gac gag gat aga aat gat att ttg 
Asn Arg Trp Asp Leu Glu He Thr Asp Glu Asp Arg Asn Asp He Leu 

245 250 255 



768 
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20040207 



PF SS443 EP 



aaa ega t:ac att got tta cat act gga atg aga gag cca aag att ate 
hys Arg Tyr lie Ala Leu His Pro Gly Met Arg GIu Pro Lys lie lie 

260 265 270 

aaa gaa tgg tea gca etc cgc ccg gga cgt aag cat gtc aga att gaa 
Lys Glu Trp Ser Ala Leu Arg Pro Gly Arg Lys His Val Arg lie Glu 
275 260 285 



616 



864 



10 gcg cag aag agg aca tct gtt gga aac tea aaa gat tat atg gtt gtg 
Ala Gin Lys Arg Thr Ser Val Gly Asn Ser Lys Asp Tyr Met Val Val 
290 295 300 



912 



cat cac tat ggt cac ggg age aac gga ttc acg ttg ggt tgg gga aca 
15 His His Tyr Gly His Gly Ser Asn Gly Phe Thr Leu Gly Trp Gly Thr 
305 310 315 320 



B6Q 



20 



gca att gaa gca act aaa ctt gtt aag act gca eta gga tta taa 
Ala lie Glu Ala Thr Lys Leu Val Lys Thr Ala Leu Gly Leu 

325 ' 330 



1005 



<21Q> 4 

<211> 334 

<212> PRT 

25 <213> Caenorhabditis elegans 



30 



<:4Q0> 4 

Met Ala Asrx lie lie Pro Lys lie Ala lie lie Gly Glu Gly Val lie 
15 10 15 

Gly Cys Thr Ser Ala Leu Gin lie Ser Lys Ala lie Pro Asn Ala Lys 

20 25 30 



He Thr Val Leu His Asp Lys Pro Phe Lys Lys Ser Cys Ser Ala Gly 
35 35 40 45 

Pro Ala Gly Leu Phe Arg He Asp Tyr Glu Glu Asn Thr Glu Tyr Gly 
50 55 . 60 

40 Arg Ala Ser Phe Ala Trp Phe Ser His Leu Tyr Arg Thr Thr Lys Gly 
65 70 75 eo 

Ser Glu Thr Gly Val Lys Leu Val Ser Gly His He Gin Sex Asp Asn 

B5 90 

45 
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Lea'GZu Ser Leu Lys Gin Gin Gin Arg Ala Tyr Gly Asp lie Val Tyr 

100 105 110 

Asn Phe Arg Phe Leu Asp Asp Arg Glu Arg Leu Asp lie Phe Pro Slu 
S 115 120 125 

Pro Ser Lys His Cys lie His Tyr Thr Ala Tyr Ala Ser Glu Gly Asn 
130 135 140 

10 Lys Tyr Val Pro Tyr Leu Lys Asn Leu Leu Leu Glu Gin Lys lie Glu 
145 150 155 160 

Phe Lys Gin Gin Glu Val Thr Ser Leu Asp Ala Val Ala Asp Ala Gly 

165 170 175 

15 

Tyr Asp Val lie Val Asn Cys Ala Gly Leu Tyr Gly Gly Lys Leu Ala 

160 IBS 190 

Gly Asp Asp Asp Thr Cys Tyr Pro lie Arg Gly Val lie Leu Glu Val 
20 195 200 205 

Asp Ala Pro Trp • His Lys His Phe Asn Tyr Arg Asp Phe Thr Thr Phe 
210 215 220 

25 Thr -lie Pro Lys Glu His Ser Val Val Val Gly Ser Thr Lys Gin Asp 
225 230 235 240 

r 

Asn Arg Trp Asp Leu Glu lie Thr Asp Glu Asp Arg Asn Asp lie Leu 

245 250 255 

Lys Arg Tyr lie Ala Leu His Pro Gly Met Arg Glu Pro Lys lie He 

260 265 270 

Lys Glu Trp Ser Ala Leu Arg Pro Gly Arg Lys His Val Arg lie Glu 
35 275 280 285 

Ala Gin Lys Arg Thr Ser Val Gly Asn Ser Lys Asp Tyr Met Val Val 
290 295 300 

40 His His Tyr Gly His Gly Ser Asn Gly Phe Thr Leu Gly Trp Gly Thr 
305 310 315 320 

Ala He Glu Ala Thr Lys Leu Val Lys Thr Ala Leu Gly Leu 

325 330 

45 



« 
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<210> S 

■ 

<211> 1186 
<212> DNA 

<213> Nectrla haematococca 

<220> 

<221> CDS 

<:222> (42) (1124) 

<:223> coding for DAAD 

<400> 5 

agcgacttga atttagcgaa aagaacttgt caaccacaat c atg tec aac aca ate 56 

Met Ser Asn Thr lie 
1 5 

gtc gtc gtt ggt gee ggt gtc att ggc ttg acg teg gcc ttg ttg etc 104 
Val Val Val Gly Ala Gly Val lie Gly Leu Thr Ser Ala Leu Leu Leu 

10 15 20 

20 tec aag aac aag ggc aac aag ate acc gtc gtg gcc aag cac atg ccc 132 
Ser Lys A.sn Lys Gly Aan Lys lie Thx Val Val Ala Lys His Met Pro 

25 30 3S 

ggc gac tat gac gtt gaa tac gcc teg cct ttt get ggt gcc aac cac 200 
25 Gly Asp Tyr Asp Val Glu Tyr Ala Ser Pro Phe Ala Gly Ala Asn His 

40 45 50 

tec ccc atg gcg acg gaa gag age age gaa tgg gaa cgt cgc act tgg 248 
Ser Pro Met Ala Thr Glu Glu Ser Ser Glu Trp Glu Arg Arg Thr Trp 
30 55 60 65 



15 



as 



tac gag ttt aag aga ctg gtc gag gag gtc cct gag gcc ggt gtt cat 
Tyr Glu Phe Lys Arg Leu Val Glu Glu Val Pro Glu Ala Gly Val His 
70 75 80 85 

ttc cag aag tct cgc ate cag agg cgc aat gtg gac act gaa aag gcg 
Phe Gin Lys Ser Arg lie Gin Arg Arg Asn Val Asp Thr Glu Lys Ala 

90 95 100 

40 cag agg tct ggt ttc cca gac gcc etc ttc teg aaa gaa ccc tgg ttc 
Gin Arg Ser Sly Phe Pro Asp Ala Leu Phe Ser Lys Glu Pro Trp Phe 

105 110 

aag aac atg ttt gag gac ttc cgt gag cag cac cct age gag gtc ate 
45 Lya Asn Met Phe Glu Asp Phe Arg Glu Gin His Pro Ser Glu Val lie 



296 



344 



392 



440 
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20040207 



10 



125 



PF 5S443 EP 



130 



ccc ggt tac gac tct ggc tgc ggg ttc aca teg gtg cgc ate aac acg 
Pro Gly Tyr Asp Ser Giy Cys Glu Phe Thr Ser Val Cys He Asn Thr 
135 140 145 



48B 



10 



gcc ate tac etc ccc tgg etc etc ggc cag tgc ate aag aat ggc gtc 
Ala He Tyr Leu Pro Trp Leu Leu Gly 61n Cys He Lys Asn Gly Val 
ISO 155 160 165 

ate gtc aag cgc gcc ate etc aac gac att age gag gcc aag aag ctg 
He Val Lys Arg Ala He Leu Asn Asp He Ser Glu Ala Lys Lys Leu 

l^^O 173 . leo 



536 



5B4 



1S 



age cac gcg ggc aag acg ccc aan ate ate gtc aac gcc acg ggt etc 
Sear His Ala Gly Lys Thr Pro Asn He He Val Asn Ala Thr Gly Leu 

185 190 195 



632 



ggc tec tac aag ctg ggc ggt gtc gag gac aag ace atg gcg cct: gcg 
20 Gly Ser Tyr Lys Leu Gly Gly Val Glu Asp Lys Thr Met Ala Pro Ala 

200 205 210 



680 



25 



egg gga cag att gtg gtt gtg cgc aac gag age age ccc atg etc etc 
Arg Gly Gin He Val Val Val Arg Asn Glu Ser Ser Pro Met Leu Leu 
215 220 225 



728 



30 



act tea ggt gtc gag gac ggc ggt get gat gtc atg tac ttg atg cag 
Thr Ser Gly Val Glu Asp Gly Gly Ala Asp Val Met Tyr Leu Met Gin 
230 235 240 245 

cga gca get ggc ggt ggc acc ate ctg ggc ggt ace tac gac gtt ggc 
Arg Ala Ala Gly Gly Gly Thr He Leu Giy Gly Thr Tyr Asp Val Gly 

250 255 260 



776 



824 



35 aac tgg gag tct cag cca gac ccc aac ate gcg aat cgc ate atg cag 
Asn Trp Glu Ser Gin Pro Asp Pro Asn He Ala Asn Arg He Met Gin 

265 270 275 



40 



cgc ate gtc gag gtg egg ccc gag att gcc aac ggc aag ggc gtc aag 
Arg He Val Glu Val Arg Pro Glu He Ala Asn Gly Lys Gly Val Lys 
2eO 2B5 290 



ggg ctg age gtg ate cga cac gcc gtc ggc atg egg ccg tgg cga aag 
Gly Leu Ser Val He Arg His Ala Val Gly Met Arg Pro Trp Arg Lys 
45 29S 300 305 



872 



920 



968 
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PF 5S443 EP 



gac gga gtc agg ate gag gag gag aag ctg ga^: gat gag act tgg ate 
Asp Gly Val Arg lie Glu Glu Glu Lys Leu Asp Asp Glu Thr Trp lie 
310 315 320 32S 



1016 



gtg cac aac tac gga cac tct gga tgg ggt tac cag ggt teg tat ggt 
Val His Aarx Tyr Gly His Ser Gly Trp Gly Tyr Gin Gly Ser Tyr Gly 

330 335 340 



1064 



10 tgt get gag aat gta gtc cag ttg get gac aag gtc gge aag gcg gee 
Cys Ala Glu Aan Val Val Gin Leu Val Asp Lys Val Gly Lys Ala Ala 

345 350 355 



11X2 



aag tct aag ctg tagttgaaaa ggectgaatg agtaatagta attggatatt 

■ 

15 Lys Ser Lys Leu 

360 



1164 



ggaaataccg tatttgccct eg 



1166 



20 <210> 6 

<21X> 361 
<212> PKT 

<:213> Mectria haematococca 

♦ 

25 <400> 6 

Met Ser Asn Thr lie Val Val Val Gly Ala Gly Val lie Gly Leu Thr 
15 10 IS 

Ser Ala Leu Leu Leu Ser Lys Asn Lys Gly Asn Lys lie Thr Val Val 
30 20 25 30 

Ala Lys His Wet Pro Gly Asp Tyr Asp Val Glu Tyr Ala Sex Pro Phe 
35 40 " 45 

35 Ala Gly Ala Asn His Ser Pro Met Ala Thr Glu Glu Ser Ser Glu Trp 
50 55 60 

Glu Arg Arg Thr Trp Tyr Glu Phe Lys Arg Leu Val Glu Glu Val Pro 
65 70 75 BO 



40 



Glu Ala Gly Val His Phe Gin Lys Ser Arg lie Gin Arg Arg Asn Val 

65 90 95 



Asp Thr Glu Lys Ala Gin Arg Ser Gly Phe Pro Asp Ala Leu Phe Ser 
45 100 105 110 
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Lys Glu Pro Trp Phe Lys Asn Met Phe Glu Asp Phe Arg Glu Gin His 
115 120 125 

5 Pro Ser Glu Val lie Pro Gly Tyr Asp Ser Gly Cys Glu Phe Thr Ser 
130 135 140 

Val Cys lie Asn Thr Ala lie Tyr Leu Pro Trp Leu Leu Gly Gin Cys 
145 150 155 160 

10 

lie Lys Asn Gly Val lie Val Lys Arg Ala lie Leu Asn Asp He Ser 

165 170 175 

Glu Ala Lys Lys Leu Ser His Ala Gly Lys Thr Pro Asn He He Val 
1S 180 185 190 

Asn Ala Thr Gly Leu Gly Ser Tyr Lys Leu Gly Gly Val Glu Asp Lys 
195 200 205 

20 Thr Met Ala Pro Ala Arg Gly Gin He Val Val Val Arg Asn Glu Ser 
210 21S 220 

Ser Pro Met Leu Leu Thr Ser Gly Val Glu Asp Gly Gly Ala Asp Val 
225 230 235 240 

25 

Met Tyr Leu Met Gin Arg Ala Ala Gly Gly Gly Thr He Leu Gly Gly 

245 250- 255 

Thr Tyr Asp Val Gly Asn Trp Glu Ser Gin Pro Asp Pro Asn He Ala 
30 260 265 270 

Asn Arg He Met Gin Arg He Val Glu Val Arg Pro Glu He Ala Asn 
275 2B0 285 

35 

Gly Lys Gly Val Lys Gly Leu Ser Val He Arg His Ala Val Gly Met 
290 295 300 

Arg Pro Trp Arg Lys Asp Gly Val Arg He Glu Glu Glu Lys Leu Asp 
40 305 310 315 320 

Asp Glu Thr Trp He Val His Asn Tyr Gly His Ser Gly Trp Gly Tyr 

325 330 335 



45 



! 



900D b/xna Eib dsya 



01 t7002-abL. 



BASF Plant Science GmbH 

13 

Gin Gly Ser Tyr Gly Cys Ala Glu Asn Val Val Gin Leu Val Asp Iiys 

340 345 350 

Val Gly Lys Ala Ala Lys Ser Lys Leu 

360 




• 


355 


<210> 


1 


<211> 


10^1 


<212> 




<213> 


Trigonopsis 


<:220> 




<:221> 


CDS 


<222> 


(1) . . (1068) 


<223> 




<:400> 


7 



atg get aaa ate gtt gtt att ggt gac ggt gtt gee ggt tta act aca 
Met Ala Lys He Val Val He Gly Ala Gly Val Ala Gly Lsu Thr Thr 
20 1 S 10 IS 

get etc caa ctt etx cgt aaa gga eat gag gtt aca att gtg tec gag 
Ala Leu Gin Leu Leu Axg Lys Gly His Glu Val Thr He Val Ser Glu 

20 25 3D 

25 

ttt acg ccc ggt gat ctt agt ate gga tat acc teg cct tgg gca ggt 
Phe Thr Pro Gly Asp Leu Ser He Gly Tyr Thr Ser Pro Xrp Ala Gly 
35 . 40 45 

30 gee aac tgg etc aea tzZ tac gat gga ggc aag tta gcc gac tac gat 
Ala Asn Trp Leu Thr Phe Tyr Asp Gly Gly Lys Leu Ala Asp Tyr Asp 
50 53 60 

gcc gtc tct tat cct ate ttg cga gag ctg get cga age age ecc gag 
35 Ala Val Ser Tyr Pro He Leu Arg Glu Leu Ala Arg Ser Ser Pro Glu 
65 70 73 flO 

get gga att cga etc ate age caa cgc tec cat gtt etc aag cgt gat 
Ala Gly He Arg Leu He Ser Gin Arg Ser His Val Leu Lys Arg Asp 
40 85 90 95 

ctt cct aaa ctg gaa gtt gee atg teg gcc ate tgr caa cgc aat ccc 

Leu Pro Lys LeU Glu Val Ala Met Ser Ala He Cys Gin Arg Asn Pro 

100 105 

45 



48 



96 



144 



192 



240 



288 



336 
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cat cga ate ctg tct aga gee etc gac cga ttc ccg gaa ctg acc aaa 
His Arg He Leu Sex Arg Ala Leu Asp Arg Phe ?ro Glu Leu Uhr Lys 
275 280 2B5 



384 



432 



480 



528 



14 

tgg ttc aaa aac aca gtc gat tct ttc gag att ate gag gac agg tec 
Trp Phe Lys Asn Thr Val Asp Ser Phe Glu He He Glu Asp Arg Ser 
115 120 125 

S agg att gtc eac gat gat gtg get tac eta gtc gaa ttt cgt tec gtt 
Arg He Val His Asp Asp Val Ala Tyr Leu Val Glu Phe Arg Ser Val 
130 135 140 

tgt ate eac ace gga gtc tac ttg aac tgg ctg atg tec caa tgc tta 
10 Cys He Hia Thr Gly Val Tyr Leu Asn Trp Leu Met Ser Gin Cys Leu 
145 150 155 160 

teg etc ggc gcc acg gtg gtt aaa cgt cga gtg aac cat ate aag gat 
Ser Leu Gly Ala Thr Val Val Lys Arg Arg Val Asn His He Lys Asp 
15 165 170 175 

gcc aat tta eta cae tee tea gga tea cgc ccc gac gtg att gtc aac 576 

Ala Asn Leu Leu His Ser Ser Gly Ser Arg Pro Asp Val He Val Asn 

IBO 1B5 190 

20 

tgt agt ggt etc ttt gcc egg ttc ttg gga ggc gtc gag gac aag aag 
Cys Ser Gly Leu Pha Ala Arg Phe Leu Gly Gly Val Glu A^p Lys Lys 

135 200 205 

25 atg tac cct att cga gga caa gtc gtc ctt gtt cga aac tct ctt cct 
Met Tyr Pro He Arg Gly Gin Val Val Leu Val Arg Asn Ser Leu Pro 
210 215 220 

tct atg gcc tec ttt tec age act cct gaa aaa gaa aat gaa gac gaa 
30 Phe Met Ala Ser Phe Ser Ser Thr Pro Glu Lys Glu Asn Glu Asp Glu 
225 230 235 240 

get eta tat ate atg ace cga etc gat ggt act tct ate att ggc ggt 
Ala Leu Tyr He Met Thr Arg Phe Asp Gly Thr Ser He He Gly Gly 
35 245 250 255 

tgt ttc caa ccc aac aac tgg tea tec gaa ccc gat cct tec etc ace 
Cys Phe Gin Pro Asn Asn Trp Ser Ser Glu Pro Asp Pro Ser Leu Thr 

260 265 270 



624 



672 



720 



768 



S16 



664 



10 



30 



45 
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IS 

gat ggc cct ctt gac att gtg cgc gaa tgc gtt ggc csac cgt act ggt 912 

Asp Gly Pro Leu Asp lie Vai Arg 61u Cys Val Gly His Arg Pro Gly 

290 295 300 

« 

aga gag ggc ggc ccc cga gta gaa tta gag aag ate ccc ggc gtfc ggc 960 
Arg Glu Gly Gly Pro Arg Val Glu Leu Glu Lys He Pro Gly Val Gly 
305 310 315 320 



fctt gtt gtc cat aac tat ggt gcc gcc ggt get ggt zac caa tec tct 
Phe Val Val ais Asn Tyr Gly Ala Ala Gly Ala Gly Tyr Gin Ser Ser 

323 330 335 



Ala Leu Gin Leu Leu Arg Lys Gly His Glu Val Thr He Val Ser Glu 

20 25 30 



Phe Thr Pro Gly Asp Leu Ser He Gly Tyr Thx Ser Pro Trp Ala Gly 
35 35 40 45 

Ala Asn Trp Leu Thr Phe Tyr Asp Gly Gly Lys Leu Ala Asp Tyr Asp 
50 55 60 

40 Ala Val Ser Tyr Pro He Leu Arg Glu Leu Ala Arg flex Ser Pro Glu 
65 70 75 80 



Ala Gly He Arg Leu He Ser Gin Arg Ser His Val Leu Lys Arg Asp 

85 90 95 



1008 



tac ggc atg get gat gaa get gtt tct tac gtc gaa aga get ctt act 1036 
Tyr Gly Met Ala Asp Glu Ala Val Ser Tyr Val Glu Arg Ala Leu Thr 
IS ' 340 345 350 

M 

cgt cca aac ctt tag 1071 
Arg Pro Asn Leu 
355 

20 

<210> 8 
<211> 356 
<212> PRT 
25 <213> Trigonopsis variabilis 

<400> 6 

■ 

Met Ala Lys He Val Val He Gly Ala Gly Val Ala Gly Leu Thr Thr 
1 5 IQ 15 



SweTree Technologies AB 20040207 PFSS443EP 
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16 



Xeu Pro Lys Leu Giu Vai Ala Met Ser Ala lie Cys 61n Arg Asn 



100 105 



110 



T^TP Phe Lys Asn Thr Val A3p Sex Phe Glu He He Glu A^p Arg Sex 

120 

Arg lie Val His Asp Asp Val Ala Tyr Leu Val Glu Phe Arg Sex Val 
"° 135 



10 cys 11. Hi. Thr Gly Val Tyr Leu Asn Trp L.u Met Ser 61n Cys Leu 

ISO 



145 



15 



sex Leu Gly Ala Thx Val Val Lys Arg Arg Val Asn His He Lys Asp 
Ala Asn Leu Leu His Sex Sex Gly Ser Arg Pro Asp Val He 



180 185 



Val Asn 
190 



cys ser Gly Leu Phe Ala Arg Phe Leu Gly Gly Val Glu Asp Lys Ly. 



200 



205 



Met Tyr Pro He Arg Gly Sin Val Val Leu VaI n-,, - 

-—-a JJBU r. 



210 21S 



220 



2S Phe Met: Ala Ser Phe Ser Ser Thr Pro Glu Lys Glu Asn Glu Asp Glu 

230 235 



240 



Ala Leu Tyr He Met Thr Arg Phe Asp Gly Thr Ser He He Gly Gly 
30 255 



Cys Phe Gin Pro Asn Asn Trp Ser Ser Glu Pro Asp Pro Ser Leu Thr 

270 



260 265 



His Arg He Leu Ser Arg Ala Leu Asp Arg Phe Pro Glu Leu Thr Lys 
35 275 ^ 



260 



285 



Asp Gly Pro Leu Asp He Val Arg Glu Cys Val Gly His Arg Pro Gly 

295 

40 Arg Glu Gly Gly Pro Arg Val Glu Leu Glu Lys He Pro Gly Val Gly 

310 -aiR ^ 

315 320 



4S 



Phe Val val His Asn Tyr Gly Ala Ala Gly Ala Gly Tyr Gin Ser Ser 

325 335 



•S BESS 129 et?+ 



9003 b/XTtiO ay dSdB 
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Tyr GXy Met Ala Aap Glu Ala Val Ser Tyr Val Slu Arg Ala Leu Thr 

340 345 

« 

Arg Pro Asn Leu 
5 355 



10 



15 



2G 



30 



35 



40 



45 



<210> 
<211> 
<212> 
<213> 

<220> 
<221> 
<222> 
<223> 



1047 
Dt!lA 

SchizosaGcharouiyces pombe 



CDS 

(22) , . (1041) 
coding for DJ\AO 



<400i> 9 

atgactaagg aaaataagcc a aga gat att gtc ate gtt ggc get ggc gtt 

Arg Asp lie Val He Val Gly Ala Gly Val 

att gga ttg acc act get tgg att ctt tea gac ttg ggt ctt get cct 
He Gly Leu Thr Thr Ala Trp He Leu Ser Asp Leu Gly Leu Ala Pro 

15 20 25 

cgt att aag gtg att gee aag tat aeg cct gaa gat cgt tet gta gaa 
Arg He Lys Val He Ala Lys Tyr Thr Pro Glu Asp Arg Ser Val Glu 

30 35 40 

tac aet tec cct tgg get ggc gca aat ttc tgt age att tct get act 
Tyr Thr Ser Pro Trp Ala Gly Ala Asn Phe Cys Ser He Ser Ala Thr 

50 55 

gat gac aat get ttg cgc tgg gat aaa ate act tac cat cgt ttc gee 
Asp Asp Asn Ala Leu Arg Trp Aap Lys He Thr Tyr His Arg Phe Ala 
fiO €5 ' 70 

tac ttg gcg aaa aet cgt cct gaa gca gga ate cgc ttt get gat etc 
Tyr Leu Ala Lys Thr Arg Pro Glu Ala Gly He Arg Phe Ala Asp Leu 

"^^ BO ae; 

cga gaa ttg tgg gag tac gag ccg aaa cac gac aaa ate aga tec tgg 

Arg Glu Leu Trp Glu Tyr Glu Pro Lys His Asp Lys He Arg Ser Trp 

100 105 



51 



99 



147 



195 



243 



291 



339 



9000 ijyxrm sb dsua 



10 
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aat acc tat gtc aga gat ttc aaa gtt ate cct gaa aaa gat ett cca 387 
Asn Thx Tyr Val Arg Asp Phe Lys Val He Pro Glu Lys Asp Leu Pro 

110 115 120 

gga gaa cgt ate tac gga cat aag gcc acc acc ttt tta ate aac get 435 
Gly Glu cys He Tyr Gly His Lys Ala Thr Thr Phe Leu He Asn Ala 

130 135 

cct cat tac ttg aat tat atg tac aag ctg etc att gaa get ggc gtc 483 
Pro His Tyr Leu Asn Tyr Met Tyr Lys Leu Leu He Glu Ala Gly Val 
140 145 150 



gaa ttt gaa aag aaa gaa ttg agt cac ate aaa gag act gtc gaa gaa 
Glu Phe Glu Lys Lys Glu Leu Ser His He Lys Glu Thx Val Glu Glu 
15 1S5 160 165 170 



20 



25 



30 



aaa ttg ggt ggc gtt gaa gac ccg gac gtt tat ccg act cgt gga cat 
Lys Leu Gly Gly Val Glu Asp Pro Asp Val Tyr Pro Thr Arg Gly His 

190 195 200 



531 



act cca gaa get tea gta gta ttt aat tgc act ggt etc tgg get tec 579 
Thr Pro Glu Ala Ser Val Val Phe Asn Cys Thr Gly Leu Trp Ala Ser 

175 180 IBS 



627 



gtt gtt ttg gtt aag get cct cat gta aca gaa act cgc ett ttg aat 675 
Val Val Leu Val Lys Ala Pro His Val Thr Glu Thr Arg He Leu Asm 
205 210 215 

ggc aag aac tct gat acc tat att att cct cgt ccc tea aat ggt gga 723 
Gly Lys Asn Ser Asp Thr Tyr He lie Pro Arg Pro Leu Asn Gly Gly 
220 22s 230 



gtc att cgc ggc ggt ttc atg caa cea gga aac tgg gat cgt gaa att 771 
Val He Cys Gly Gly Phe Met Gin Pro Gly Asn Trp Asp Arg Glu He 
3S 235 240 245 250 

cac cct gaa gac act ttg gat ate ctt aag aga aca teg get ttg arg BIB 
His Pro Glu Asp Thr Leu Asp He Leu Lys Arg Thr Ser Ala Leu Met 

255 260 265 

40 

cca gaa ttg ttc cac ggc aag ggt ccg gag ggt get gaa att att caa 8 67 

Pro Glu Leu Phe His Gly Lys Gly Pro Glu Gly Ala Glu He He Gin 

270 275 280 

46 



9003 u/xra au dsba 



10 



15 
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gaa tgt gtc gga ttc cgt cct tct cga aag ggt ggt gcc cgc gta gag 91^ 
Glu Cys Val Gly Phe Arg Pro Ser Arg Lys Gly Gly Ala Arg Val Glu 
285 290 295 



ctt gar gtt gtt ccc ggc acc tea gtc ecc cct gtt cat gat tac ggt 
Leu Aap Val- Val Pro Gly Thr Sex Val Pro L,.u Val His Asp Tyr Gly 
300 305 

get tct ggc aca gga tac eaa get ggt tat ggt atg get ctt gac tct 
Ala Ser Gly Thr Gly Tyr Gin Ala Gly Tyr Gly Met Ala Leu Asp Ser 
3" 320 32S 

gtc atg ttg get cct cct aaa ate aaa ttg gcttag 
Val Met Leu Ala l»eu Pro Lys lie Lya Leu 

335 



963 



1011 



1047 



<210> 10 

<:211> 340 

20 <212> PRT 

"^213> Schizasaccharomyces pombe 

<400> 10 

Arg Asp He Val He Val Gly Ala Gly Val He Gly Leu Thx Thr Ala 
2S 1 5 10 ' 15 

Trp He Leu Ser Asp Leu Gly Leu Ala Pro Arg He Lys Val He Ala 

20 25 30 

30 Lys Tyr Thr Pro Glu Asp Arg Ser Val Glu Tyr Thr Ser Pro Trp Ala 

40 45 

Gly Ala Asn Phe Cys Ser He Ser Ala Thr Asp Asp Aan Ala Leu Arg 
50 SS 60 

35 

Trp Asp Lys He Thr Tyr His Arg Phe Ala Tyr Leu Ala Lys Thr Arg 
" 70 7s 80 

Pro Glu Ala Gly He Arg Phe Ala Asp Leu Arg Glu Leu Trp Glu Tyr 

as 90 95 

Glu Pro Lys His Asp Lys He Arg Ser Trp Asn Thr Tyr Val Arg Asp 

100 105 

45 • 



BS^ST 'S 



8£SgS IE9 617+ 



9003 bxXrO 30 dsoa 



TT:iT t7002-iftJW 



6W]*d 2eo:'-»u-^diii3 sua m£/^L\:^]^' m 
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20 

Phe Lys Val lie Pro Glu Lys Asp Leu Pro Gly Glu Cys lie Tyr GXy 

120 123 

His Lys Ala Thr Thr Phe Leu He Aan Ala Pro Hia Tyr Leu Asti Tyr 
^ 130 135 140 

Mer Tyr Lys Leu Leu He Glu Ala Gly Val Glu Phe Glu Lya Lys Glu 

ISO 155 160 

10 Leu Ser His He Lys Glu Thr Val Glu Glu Thr Pro Glu Ala Ser Val 

16S 170 



15 



Val Phe Asn Cys Thr Gly Leu Trp Ala Ser Lys Leu Gly Gly Val Glu 

180 18S 190 

Aap Pro Asp Val Tyr Pro Thr Arg Gly His Val Val Leu Val Lya Ala 

200 20S 



Pro His Val Thr Glu Thr Arg He Leu Asn Gly Lys Asn Ser Asp Thr 
20 210 215 



220 



Tyr He He Pro Arg Pro Leu Asn Glv gL'u vai Tie. r„s /=i 
225 230 235 



240 



25 



Met Gin Pro Gly Asn Trp Aap Arg Glu He His Pro Glu Asp Thr Leu 

2«S 250 255 



30 



Asp He Leu Lya Arg Thr Ser Ala Leu Mfet Pro Glu Leu Phe His Gly 

260 265 270 

Lys Gly Pro Glu Gly Ala Glu He He Gin Glu Cys Val Gly Phe Arg 
275 280 285 



35 Pro Ser Arg Lys Gly Gly Ala Arg Val Glu Leu Asp Val Val Pro Gly 
290 295 300 



40 



Thr Ser Val Pro Leu Val Hia Asp Tyr Gly Ala Ser Gly Thr Gly Tyr 

310 31S 320 

Gin Ala Gly Tyr Gly Met Ala Leu Asp Ser Val Met Leu Ala Leu Pro 

325 330 335 



Lys He Lys Leu 
46 340 



0SX9T*S 
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<210> 
<211> 
<212> 
<213> 



21 



11 
DMA 

Snreptoinyces coelicolor 



10 



15 



20 



25 



<r22Qi> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 



CDS 

(31) , . (95-?) 
coding for DAAO 



mis cofeature 
(B80) (936) 
DAAO Signature 



<400> 11 

gtggaaaccg aactggatga 



cgagcgggac ggc gaa gtc gtc gtg gtc ggc ggc 

Gly Glu Val Val Val Val Gly Gly 
1 5 



TOg gtg ate ggg ctg acg acg gcc gtc gtc etc gcc gag egg ggc aga 
Gly Val lie Gly Leu Thr Thr Ala Val Val Leu Ala Glu Arg Gly Arg 
10 15 20 

gtg egg ctg tgg ace egg gag ccc gcg gag egg ace ace teg gtg 
Arg Val Arg Leu Trp Thr Arg Glu Pro Ala Glu Axg Thr Thr Ser Val 
25 30 35 40 



54 



102 



150 



30 



gta gcg ggc ggg ctg tgg tgg ccg tac egg ate gag ccg gtc geg ctg 
Val Ala Gly Gly Leu Trp Trp Pro Tyr Arg lie filu Pro Val Ala Leu 

45 50 55 



19B 



35 



gcc cag gcc tgg gcg ctg cgt tec ctg gae gtg tac gag gag ctg gcg 
Ala Gin Ala Trp Ala Leu Arg Ser Leu Asp Val Tyr Glu Glu Leu Ala 

60 65 70 



246 



gca egg ccc ggg cag aec ggc gta cgc atg etc gaa ggg gtg etc ggc 294 
Ala Arg Pro Gly Gin Thr Gly Val Arg Met Leu Glu Gly Val Leu Gly 
75 80 85 

40 

gag aec ggc ctg gac gag gtg gae ggg tgg gee gcg gee egg ctg ccg 342 
Glu Thr "Gly Leu Asp Glu Val Asp Gly Trp Ala Ala Ala Arg Leu Pro 
90 95 100 

45 ggg etg cgc gcg gcg age gcc gcc gag tac gcc ggg acg ggg ctg tgg ' 390 



9003 b/xra ab dsba 
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Gly Leu Arg Ala Ala Ser Ala Ala Giu Tyr Ala Gly Thr Gly Leu Trp 
lOS 110 115 1^0 

gcg egg ctg ccg etc ate gac atg teg acc cat ctg ccg tgg ctg egg 
Ala Arg Leu Pro Leu He Asp Met Ser Thr His Leu Pro Trp Leu Arg 

125 130 



438 



10 



15 



20 



25 



30 



gag egg ctg ctg gee gcg ggc ggc aeg gtg gag gac cgc gcg gtg acc 
Glu Arg Leu Leu Ala Ala Gly Gly Thr Val Glu Asp Arg Ala Val Thr 

140 145 150 

gat etg gcc gag gcg gac gcg ccg gtg gtg gtc aac tgc acc ggc ctg 
Asp Leu Ala Glu Ala Asp Ala Pro Val Val Val Asn Cya Thr- Gly Leu 
153 160 . 165 

ggc gcc egg gag ctg gtg ccg gac ccg gcg gta egg ccg gtg cgc gga 
Gly Ala Arg Glu Leu Val Pro Asp pro Ala Val Arg Pro Val Arg Gly 
170 17S 130 

cag ctg gtc gtc gtg gag aac ccc ggc ate cac aac tgg ctg gtc gcg 
Gin Leu Val Val Val Glu Asn Pro Gly He His Asn Trp Leu Val Ala 
165 190 195 200 

gcc gac gcg gac tee ggg gag aeg acg tac tte ctt ccg cag ccg gga 
Ala Asp Ala Asp Ser Gly Glu Thr Thr Tyr Phe Leu Pro Gin Pro Gly 

205 210 215 

egg etc ctg ctg ggc ggc acg get gag gag gac gcc tgg teg acc gag 
Arg Leu Leu Leu Gly Gly Thr Ala Glu Glu Asp Ala Trp Ser Thr Glu 

220 225 230 



486 



534 



582 



630 



678 



726 



35 



40 



ccg gac ccg gag gtc gcg gcg gcc ate gtg cga egg tgc gcg gcc ctg 
Pro Asp Pro Glu Val Ala Ala Ala He Val Arg Arg Cya Ala Ala Leu 
235 240 245 

cgt cee gag ate gcc gga gcg egg grg etc gcg cac ctg gtg ggg ctg 
Arg Pro Glu He Ala Gly Ala Arg Val Leu Ala His Leu Val Gly Leu 
250 255 260 

egg ccg gcc egg gac gcg gtc egg ctg gag cgc ggg acg ctg ccg gac 
Arg Pro Ala Arg Asp Ala Val Arg Leu Glu Arg Gly Thr Leu Pro Asp 
265 270 275 280 



ggg cgc egg ctg gtg cac aac tac ggt cac ggc ggc gcg ggc gtc acc 
45 Gly Arg Arg Leu Val His Asn Tyr Gly His Gly Gly Ala Gly Val Thr 



774 



S22 



870 



die 



2£yBT "S 



B£S2S T29 6t?+ 



900D b/xna 3b dsoa 



El: LI t?00S-abl 



nam science GmbH 

285 



sua l700S/80/aui92"j-diin 



23 

290 



295 



gtg gcc tgg ggc tgc get cag gag gcg gcc egg etc gcc ccctga 
Val Ala Trp Gly cys Ala Gin Glu Ala Ala Arg Leu Ala 

300 305 



963 



<210> 12 

<211> 309 

10 <212> PRT 

<213> Streptomyces coelicolor 

<220> 

<221> misc^f eatiire 

15 <222> (880) --(936) 

<223> DAAO signature 

<:400> 12 

Gly Glu val Val Val Val Gly Gly Gly Val He Gly Leu Thr Thr Ala 

10 15 

Val Val Leu Ala Glu Arg tSly Axg Arg Val Arg Leu Trp Thr Arg Glu 

20 25 30 

25 Pro Ala Glu Arg Thr Thr Ser Val Val Ala Gly Gly Leu Trp Trp Pro 

35 40 45 ' 

Tyr Arg lie Glu Pro Val Ala Leu Ala Gin Ala Trp Ala Leu Arg Ser 
50 55 60 

30 

Leu Asp Val Tyr Glu Glu Leu Ala Ala Arg Pro Gly Gin Thr Gly Val 

•'0 75 eo 

Arg Met Leu Glu Gly Val Leu Gly Glu Thr Gly Leu Asp Glu Val Asp 
35 85 90 95 

Gly Trp Ala Ala Ala Arg Leu Pro Gly Lau Arg Ala Ala Ser Ala Ala 

105 

40 Glu Tyr Ala Gly Thr Gly Leu Trp Ala Arg Leu Pro Leu He Asp Met 

120 125 

Ser Thr His Leu Pro Trp Leu Arg Glu Arg Leu Leu Ala Ala Gly Gly 
130 13S 140 

45 



tSASF Plant Science GmbH 



21 : a mzm/LVA]^^' m 



24 



Thr Val Glu Asp Arg Ala Val Thr Asp Leu Ala Glu Ala Asp Ala E»ro 

ISO 155 

Val Val Val Asn Cys Thr Gly Leu Gly Ala Arg Glu Leu Val Pro Asp 
5 165 17Q 175 

Pro Ala Val Arg Pro Val Arg Gly Gin Leu Val Val Val Glu Asn Pra 

IBO leS 190 

10 Gly lie His Asn Trp Leu Val Ala Ala Asp Ala Asp Ser Gly Glu Thr 

195 200 205 



15 



Thr Tyr Phe Leu Pro Gin Pro Gly Arg Leu Leu Leu Gly Gly Thr Ala 
210 215 220 

Glu Glu Asp Ala Trp Ser Thr Glu Pro Asp Pro Glu Val Ala Ala Ala 
^25 230 235 240 



He Val Arg Arg Cys Ala Ala Leu Arg Pro Glu He Ala Gly Ala Arg 
20 245 250 



255 



Val Leu Ala His Leu Val Gly Leu Arg Pro Ala Arg Asp Ala Val Arg 



2 / u 



25 Leu Glu Arg Gly Thr Leu Pro Asp Gly Arg Arg Leu Val His Asn Tyr 

275 280 285 



30 



Gly His Gly Gly Ala Gly Val Thr Val Ala Trp Gly Cys Ala Gin Glu 
290 295 300 

Ala Ala Arg Leu Ala 
305 



<210> 13 

35 <211> .1038 

<212> DNA 

<213> Candida boidinii 



40 



<220> 
<221> 
<:222> 
<223> 



CDS 

(1) . . (1035) 
coding for DAAO 



45 



900D fci/xna ab dsua 



2T :il 



8Wd 2S0:"-«u-:^diig 2i:a t?002/80/ii:HB2-|du 
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25 

<400> 13 

atg ggt gat c&a att gtt gtt ctt ggt tec ggt att att ggt cca tat 48 
Mec Gly Asp Gin lie Val Val Leu Giy Ser Gly lie lie Gly Leu Tyr 
I $ 10 15 



act aca tac tgt tta ate tat gag get gga tgt get cca get aaa att 
Thr Thr Tyr Cys Leu He Tyr Giu Ala Gly Cys Ala Pro Ala Lys He 

20 25 30 



act gaa tat tgg gat ttt tat cct ggt gat gaa aaa gtc aat tct tta 
Thr Glu Tyr Trp Asp Phe Tyr Pro Gly Asp Glu Lys Val Asn Ser Leu 

100 105 110 



96 



act att gtt get gaa ttt nta cca ggt gat caa tct aca tta tat aca 144 
Thr He Val Ala Glu Phe Leu Pro Gly Asp Gin Ser Thr Leu Tyr Thr 
35 40 45 

tct cca tgg gca ggt ggt aat ttt tct tgt att tea cca get gat gat 192 
Ser Pro Trp Ala Gly Gly Asn Phe Ser Cys He Ser Pro Ala Asp Asp 
50 55 6b 

■ 

■ 

aca aca ttg gcc tat gat aaa ttc aca tat ctt aat tta ttc aag att 240 
Thr Thr Leu Ala Tyr Asp Lys Phe Thr Tyr Leu Asn Leu Phe Lys lie 
65 70 75 80 

cac aaa aaa tta ggt gga cca gaa tgt gga tta gat aat aag cca agt 288 
His Lys Lys Leu Gly Gly Pro Glu Cys Gly Leu Asp Asn Lys Pro Ser 

85 * 90 95 



336 



aaa caa tat ctt aaa gat ttt aaa gtt att cca aaa tea gaa tta cca 384 
Lys Gin Tyr Leu Lys Asp Phe Lys Val He Pro Lys Ser Glu Leu Pro 
115 120 125 



gaa ggt gtt gaa tat ggt att agt tat act aca hgg aat ttc aac tgt 432 
Glu Gly Val Glu Tyr Gly He Ser Tyr Thr Thr Trp Asn Phe Asn Cys 
130 135 140 

cct gtt ttc tta caa aat atg get aat ttt tta aat aaa aga aat gtt 480 
Pro Val Phe Leu Gin Asn Met Ala Asn Phe Leu Asn Lys Arg Asn Val 
14S 150 155 160 

acc att att aga aaa cat tta aca cat att tct caa get tat tta aca 528 
Thr He He Arg Lys His Leu Thr His He Ser Gin Ala Tyr Leu Thr 

165 170 175 



8£S2S IS9 6t7+ 



900D d^xra 3d dsba 
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m'6 2S0:"-»u-^dui3 

BASF Plant Science GmbH 



ma i?oo3/8o/a:u^2-^dug 



26 



gtt aat aca aaa gtt gtt ttc aac cgL aca ggt att ggc gcr gcr gat 
Val Asn Thr Lys Val Val Phe Asn Cys Thar Gly lie Giy Ala Ala Asp 

ISO ISS 190 



57S 



tta ggt ggt gtt aaa gat gaa aaa gtt tat cca act aga gga caa gtc 
Leu Gly Gly Val I^ys Asp Glu Lys Val Tyr Pro Thr Arg Gly Gin Val 
195 200 205 



624 



gtt gtt gtt aga get cca cat att caa gaa aat aaa atg aga tgg ggt 
10 Val Val Val Arg Ala Pro His lie Gin Glu Asn Lys Met Arg Trp Gly 
210 215 220 



672 



IS 



aaa gac tat get act tat att att cca aga cca tat tct aat ggt gaa 
Lys Asp Tyr Ala Thr Tyr lie lie Pro Arg Pro Tyr Ser Asn Gly Glu 
225 230 235 240 



720 



20 



tta gtc tta ggt ggt ttc tta caa aag gat aat Egg aca ggt aat act 

Leu Val Leu Gly Gly Phe Leu Gin Lys Asp Asn Trp Thr Gly Asn Thr 

245 250 255 

ttt ggt ttt gaa act gat gat att gtt agt aga act aca tct tta tta 

Phe Gly Phe Glu Thr Asp Asp lie Val Ser Arg Thr Thr Ser Leu Leu 

260 265 270 



768 



816 



cca aag att tta gat gaa cca ett cat att att aga gtt gca get ggt 
Pro Lys lie Leu Asp Glu Pro Leu His lie lie Arg Val Ala Ala Gly 
275 280 285 



8 64 



eta aga cca agt aga cat ggt ggt cca aga att gaa get gaa get tgt 
30 Leu Arg Pro Ser Arg His Gly Gly Pro Arg lie Glu Ala Glu Val Cys 
290 295 300 



912 



gaa gaa ggt aaa tta act att cat aat tat ggt get tct gga tat ggt 
Glu Glu Gly Lys Leu Thr lie His Asn Tyr Gly Ala Ser Gly Tyr Gly 

305 310 315 aao 



960 



40 



tat caa get ggt tat ggt atg tct tat gaa get gtc aaa ctt tta gtt 
Tyr Gin Ala Gly Tyr Gly Met Ser Tyr Glu Ala Val Lys Leu Leu Val 

325 330 335 

gat aac caa aaa gtt aaa get aaa ctt tag 
Asp Asn Gin Lys Val Lys Ala Lys Leu 

340 346 



lOOB 



1038 



45 



0S/22-S 



8£S2S T29 SP-i- 



900D u/xna ad dsba 
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<210> 14 
<211> 345 
<212> PRT 

<213> Candida boidinii 

5 

<AQQ^ 14 

Met Gly Asp Gin lie Val Val Leu Gly Ser Gly lie lie Gly Leu Tyr 
1 » 5 10 15 

10 Thr Thr Tyr Cys Leu lie Tyr Glu Ala Gly Cys Ala Pro Ala Lys 11© 

20 25 30 

Thr He Val Ala Glu Phe Leu Pro Gly Asp Gin Ser Thr Leu Tyr Thr 
35 40 45 

15 

Ser Pro Trp Ala Gly Gly Asn Phe fier Cys lie Ser Pro Ala Asp Asp 
SO 55 60 

Thr Thr Leu Ala Tyr Asp Lys Phe Thr Tyr Leu Asn Leu Phe Lys He . 
20 65 70 75 ao 

His Lys Lys Leu Gly Gly Pro Glu Cys Gly Leu Asp Asn Lys Pro Ser 

65 90 95 

25 Thr Glu Tyr Trp Asp Phe Tyr Pro Gly Asp Glu Lys Val Asn Ser Leu 

100 105 110 

Lys Gin Tyr Leu Lys Asp Phe Lys Val He Pro Lys Ser Glu Leu Pro 
115 120 125 

30 

Glu Sly Val Glu Tyr Gly He Ser Tyr Thr Thr Trp Asn Phe Asn Cys 
130 135 140 

Pro Val Phe Leu Gin Asn Met Ala Asn Phe Leu Aan Lys Arg Asn Val 
3S 145 150 155 160 

Thr He He Arg Lys His Leu Thr His He Ser Gin Ala Tyr Leu Thr 

165 170 175 

40 Val Asn Thr Lys Val Val Phe Asn Cya Thr Gly He Gly Ala Ala Asp 

180 185 190 

Leu Gly Gly Val Lys Asp Glu Lys Val Tyr Pro Thr Arg Gly Gin Val 

195 200 205 

45 



900D y/xra 3b dsoa 
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Val Vai Val Arg Ala Pro His He Gin Glu Asn Lys Met Arg Trp Gly 
210 215 220 

Lys Asp Tyr Ala Thr Tyr He He Pro Arg Pro Tyr Ser Asn Gly Glu 
5 225 230 235 240 



Leu Val Leu Gly Gly Phe Leu Gin Lys Asp Asn Trp Thr Gly Asn Thr 

245 250 255 

10 

Phe Gly Phe Glu Thr Asp Asp He Val Ser Arg Thr Thr Ser Leu Leu 

260 265 270 

Pro Lys He Leu Asp Glu Pro Leu His He He Arg Val Ala Ala Gly 
1S 275 280 285 

Leu Arg Pro Ser Arg His Gly Gly Pro Arg He Glu Ala Glu Val Cys 
290 295 300 

20 Glu Glu Gly Lys Leu Thr He His Asn Tyr Gly Ala Ser Gly Tyr Gly 
305 310 315 320 

Tyr Gin Ala Gly Tyr Gly Met Ser Tyr Glu Ala Vai Lys Leu Leu Val 

325 330 335 

Asp Asn Gin Lys Val Lys Ala Lys Leu 

340 345 



30 <210> 15 

<211> 12466 

<212> DVfi 

<212> vector daaoScelTetON 



35 



<220> 
<221> 
<222> 
<223> 



misc^feature 
(38) (183) 

Agrobacteriu&L right border 



40 



■<:220> 
<221> 
<222> 
<:223> 



xais c__f eatur e 
(445) (462) 

recognition / cleavage site for I-Scel endonuclease 
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<220> 

<221> terminator 

<222> (196} (400J 

<223> complementary: 35S terminator 



<220> 
<221> 
<222> 
<223i' 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 

<223> 

<220> 
<221> 
<222> 
<r223> 



misc^feature 
(SIS) . . (1222) 
compl eznent ary 



coding for I-Scel endonuci 



ease 



« 

promoter 
(1270) . . (ISfiO) 

complementary: coding for pTOPlOP teraeyclin 

t erminat or 
(1735) . . (1990) 

complementary: NOS terminator 



regulatable promoter 



25 



miac_feature 
(2067) (3173) 

complementary: coding for Rhodotorula gracilis Diamine acid oxida 



se 



<220> 
<221> 
<222> 
30 <223> 

<220> 
<221> 
<222> 
35 <223> 

<220> 
<221> 
<222> 
40 <223> 

<:220> 
<221> 
<222> 
4S <223> 



promoter 
(3217) . . (5028) 

complementary: Arabidopsis thaliana nitrilaa 



B I promoter 



terminator 
(5118) (5343) 

complementary: OCS terminator 



misc^feature 
(5418) (6425) 

complementary: coding for tetracyclin repressor rtTA 



promoter 
(6479) (7341) 
coznplement ary : 



coding for Pisum sat:i^nam ptxA promo t 
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ma t?oo^/80/a:H92•:^c^llg 

PF 55443 EP 



<220> 
<221> 
<222> 
<223> 



terminator 
(7345) . . (7549) 

complementary: coding for 23S terminator {functioning as homology 
sequence) 



<220> 
<221> 
<222> 
<223> 



mis cofeature 
(7618) . . (7B34) 
Agrobacterium left border 
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<400> 15 
aatattcaaa 
aaaccttttc 
tttacccgcc 
arcagacctg 
aagtatttta 
aaccetataa 
aatagagaga 
tgtctcacga 
tgatggatat 
tagtgttcgg 
agaccaggta 
aacagttcag 
tgaaagactg 
taccaccatc 
ggrtgttcgg 
cttggtgttt 
gttctttttt 
tgtatgcttt 
ggatgtaagc 
cgatcagctg 
ggtttttrtt 
cgttactagc 
cgaagactag 
aagtcgagtt 
cctarcagtg 
aagtgaaagt 
ccactcccta 
gagaaaagtg 
tagtggatca 
tctagtaaca 
ttttctahcg 
aataacgtca 
atgaxiaatca 



caaacacata 
acgccctttc 
aatatatcct 
gtacccggtc 
caaatacaaa 
gaaccctaat 
gatagatttg 
cgttttgaac 
ctgcagaatt 
cagtttgtac 
agacatiagaa 
ttggaatttg 
ggtgttcagt 
atc:eatgaac 
gatggttttt 
gaaagtctgg 
gtgcggcggg 
gtctttccac 
atcacccagg 
ggatttgtat 
gatgtttttc 
czcgggctcga 
atcactctat 
taocactccc 
atagagaaaa 
cgagtttacc 
tcagtgatag 
aaagtcgagc 
attcactggc 
tagatgacac 
cgtattaaat 
tgcattacat 
tcgcaagacc 



cagcgcgact 
aaatatccga 
gtcaaacact 
actggatttt 
tacatactaa 
•cccGttatct 
tagagagaga 
ccagattacc 
cgccctttta 
atoatctaco 
tcgatgcaga 
ttacgcagac 
acgarcgatt 
cagtatgcca 
ttgttgttaa 
gcgccecagg 
gacagtaccc 
tcgaactgca 
atcagaccga 
tctttcagca 
atatggtcga 
gcaaatgtct 
ctcgagttta 
tatcagtgac 
grgaaagtcg 
actccctatc 
agaaaatgaa 
tcggtaccga 
cgtcgtttta 
cgcgcgcgat 
gtataattgc 
gttaattafct 
ggcaacagga 



tatcatggac 
ttattctaat 
gatagtttaa 
ggttttagga 
gggrttctta 
gggaactact 
ctggtgattt 
ctgttatccc 
gatcttattt 
ggateaggta 
tgatcggttt 
ccttaaccag 
tgttggtaga 
gagacatcgg 
cgatgaaoag 
tgattaccag 
actgatcgta 
tacagtaggt 
.tacctgcttc 
gtttagagtt 
ctaaagggcg 
agaaaggcct 
ccactcccta 
agagaaaagt 
agtttaccac 
agtgatagag 
agtcgagttt 
gctcgaattc 
caacgactca 
aatttatcct 
gggactctaa 
acargcttaa 
ttcaatctta 



atacaaatgg 
aaacgccctt 
actgaaggcg 
attagaaatt 
tatgctcaac 
cacacattat 
ttgcgccggg 
tagtcgagcg 
caggaaagtt 



gtttttgttg 
gtattotact 
gtttttgttg 
ggtcaggtag 
gcragccagt 
gttacccagg 
cagcagacat 
tttaccttca 
gaactgttcg 
cggacccagg 
aattccagca 
tatatacgta 
tcagtgatag 
gaaagtcgag 
tccctatccg 
aaaagtgaaa 
acoactccct 
agcacactgg 
gagcttgaca 
agtttgcgcg 
tcataaaaac 
cgtaattcaa 
agaaacttta 



acgaacggat 6o 

ttctcttagg 120 

ggaaacgaca 180 

ttattgatag 240 

acatgagcga 300 

tctggagaaa 360 

taccccaaac 420 

gccgccagtg 480 

tcggaggaga 540 

-aggttgtaga 600 

atttttacgt 660 

tcttcgaaag 720 

caatcccatt 780 

ttttcaacca 840 

ttgttgaaag 900 

tggttaacac 960 

acgtggtcca 1020 

tcacgagaac 1080 

atgttcagtt 1140 

ttcattacct 1200 

cactggcggc 1260 

aagggtctcg 1320 

agaaaagtga 1380 

tttaccactc 1440 

tgatagagaa 1500 

gtcgagttta 1560 

atcagtgata 1620 

cggccgttac 16B0 

ggaggcccga 1740 

ctatattttg 1800 

ccatctcata IB 60 

cagaaactat 1920 

ttgccaaatg 1980 
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tttgaacgat 
agcaagatct 
cgctggaacg 
tatcccgcac 
S cgtgcgctgc 
cgrt:c1:gcct 
aggacctcga 
tgcttgagga 
tacgtcccgc 

10 gggtcggacg 
cggattggcc 
gcgttgacca 
rcaaacgtcg 
tggacggaga 

15 agggggcggt 
tgcgcgaacc 
cactccttga 
aaaggcgtcc 
ggcaagtcgc 

20 ctcagaccga 
cgactagagt 
tgacactcag 
cacgtttatt 
aggtgggact 

25 aaagtitcaca 
accagccgat 
aacggtggtt 
aactaccaaa 
agaatttatt 

30 taggt:aataa 
ttggtcgtac 
agtatttttt 
tgtcataaat 
gaaatgtgag 

3S gaattatctc 
ggtcaaatat 
attcctacaa 
ccaattagtt 
atacttcatt 

40 ctttcttact 
attgtcaatg 
tttctgattg 
acatatacgt 
acafcttttaa 

45 acagtaactt 



cggggatcat 

agagcttggg 

cc^tcgtcgac 

tcgagaagcc 

ccctgccgag 

caacgcgggg 

tgccttcgat 

tccgctggac 

cgcagatgac 

agtccatcgt 

cggcggcttg 

ccaaatcdgc 

cgccgagctt 

gggtgtcgta 

aatttggcgt 

gectcgtccc 

aagtcgattc 

aattcgcgcc 

gcgcgagaat 

taacgcctga 

cgagatccga 

agacttgaga 

ttcctattgg 

agcccacgtg 

ccaaacctac 

gctttatgtc 

tacaaagtat 

taaaaaacaa 

Cttttatgga 

taiiatacbat 

gattctacct 

ttrtggtttg 

aaaatttgac 

tttggctcct 

agagaaactg 

ctttggattt 

gagaaatcaa 

cgcafccaaga 

gcagctttca 

tgcaggaact 

aaacgaaata 

tgacagccat 

aatggtttag 

gaaaaataaa 

aalitattaaa 



ccgggtctgt 

ccccgcctac 

gagctgcgcg 

atacgcatgc 

cgagaggggc 

tccgcctcgt 

cgttccgncg 

cgtctctggg 

ttcgccacct 

gcatcgcttg 

gtcgtcgatg 

accgtcgaac 

ctgcagctct 

ggttacgccg 

gatgtccttg 

e:ttgagccac 

ttcccatttt 

agcccatgg-k 

atgcacgctg 

tccgaggaca 

tatcgcccgg 

gaagcaatat 

agacggtaac 

gtttcttttc 

1iaat:aa1it:ga 

tgaatiataaa 

ttggacacaa 

ataaaaaact 

ttccttaaat 

gttacagata 

aaraatattg 

tcttagctgt 

gtatagattt 

atttaatctt 

gattaaaraa 

actattattg 

aaiittttgaa 

aaaccaaacc 

ga-taaaagaa 

caaatcccac 

gtctttttat 

ccajiatatat 

cjatagcttct 

ttaatatatg 

cattctaaat 
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ggcgggaact 
aacttcgacst 
acatcctccg 
acaagcgtga 
gactttgtcc 
cgtgcaggtc 
ctcgagatgg 
ttgacagaca 
ggtcggggaa 
catggggact 
cccgcaatcg 
gcctgctcaa 
cttgcaaggx: 
at:agcgc£=ag 
taccagtgcc 
atggcatggc 
gcttgtcgag 
gaagcgaaag 
tagcccttcc 
acgacgcgct 
gctcgagbct 
atagaccttt 
gaagatcgaa 
tctgcattga 
gaagaaaaat 
aaagaagi:ga 
taaatccaaa 
tiaaaagaatt 
ccatatt.tga 
tgtttctaat 
aacggaattg 
atttcattaa 
tttaaafccca 
aggatgggtt 
actaaaaaat 
gcgaaaagaa 
ttaacatgga 
ccaattaata 
aaaaacacac 
tttgccactt 
gcaaacac^tg 
agggaatgta 
attttcgttg 
tatattctrg 
gcaaatatgc 
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ccacgaaaat 

cccgcgccgc 

ccgcgcccca 

cctccttccc 

ggtcgagagg 

gcaagccgac 

tcgggtcgag 

agccccagtc 

tgatgtaggc 

tgacgaggac 

acttggcgcG 

gcgacgtaac 

actggcagt.a 

gtggacattc 

cgagcaagcc 

ccgticgggac 

gaccgtctgt 

tctggctcga 

gagcgaggat 

tctgcgagtg 

ttgtttttta 

ttttgttttt 

cctgtggtgg 

tttgtttttg 

aaaatctatt 

aaaccccgrt 

cgaaataaaa 

tattccattt 

tgcattttga 

tcgtattaac 

atgttttgga 

atataaattt 

ttttatgtta 

aatactaagt 

agatcaatgt 

agtctcatat 

ttgtatagtt 

atcaaacggg 

actcaagtct 

ttctttacaa 

tttgtctttt 

aaacaacaac 

tcaatattag 

gaactaatgl; 

aaagaaaaaa 



atccgaacgc 2040 

gccgtggtac 2100 

actctgctgg 2160 

cttcgccgct 2220 

caggacgatc 2280 

gttgtggcgg 2340 

gcgcaagcag 2400 

tcccacgcrcg 24^0 

gggagaagcg 2520 

ggtttgcccg 2380 

aagtcccgna 2640 

ggtccgtctc 2700 

ctfctggtgcg 2750 

ggaagatggg 2820 

gtcttcgttc 2aa0 

caactcgacc 294 0 

aagcgtcatg 3000 

gacgt cc t cc 3060 

gagggcgctg 3120 

cat:gggccc t 3 1 B 0 

ctttggtitca 3240 

tttttgtggt 3300 

aaat gaaaca 3360 

ttttttttgt 3420 

gattgactaa 34B0 

taagaattac 3540 

caaaatggag 3600 

tttttcccgt 3660 

ttcctcataa 3720 

ctaccttctt 3780 

ccactcagaa 384 0 

aaataagaaa 3900 

tttaatattfc 3960 

tttccttaat 4020 

gtttfcggtcc 4080 

agt aaat cat 4140 

tcttatataa 4200 

cttggtagga 4260 

tttatttcat 4320 

ataaacacaa 4380 

ttcgatcacg 44 40 

atgtgaagtc 4 500 

tcattccaaa 4560 

atgiiggaaat 4 620 

aagaaaagaa 4 680 



900D b/XrS DB :JStia 



950' d 2S0:'J'J-:^dug 

sAsr fiant science GmbH 



cacaactgaa 
aacacaacat 
agagaattgg 
ttgaaactca 
5 atcacaaccg 
accggtttct 
ggtagggata 
aattcgagct 
aaataacaag 
10 attcagaaat 
gagtgcgata 
atcttgcgcc 
ccaggagagt 
tgctcaaact 
IS aatcccggac 
tcggcatgcg 
tcggtcgggg 
ggagccgcca 
atggtagacc 
20 ttaagttgtt 
gcaccttggt 
gtaggtgttt 
aaagtaaaa fc 
tggcataaaa 
25 tcrgaatgta 
ttattgcgta 
ctatctaaca 
tacagtgtag 
attccgacct 
30 gacatggtcg 
ttcgaagatt 
agaatgagtg 
tctttctcgt 
tttttatgtt 
35 atgaatteaa 
aatttgttta 
gacatttbtt 
aatcttaatc 
gtgcraaact 
40 gaaatgtgtc 
aatggaagaa 
taattgacaa 
ggtttcaaag 
tccttgaatg 
45 ctcagcttca 



atcaaagcca 
ccagaattga 
ttctcttcca 
acataacatt 
attiacttgtt 
tcgggtgctc 

acagggtaat 

cggtaccaar 

tcaggtatta 

atttcaataa 

ttatgtgtaa 

gggtsccgag 

cgttgatcta 

cgaagtcggG 

ccggggaatc 

ccatcgccac 

gggccgtcga 

gccccgcctc 

cgtaattgtt 

tttcstaatcc 

gatcaaataa 

ccctttcttc 



gattcataat 

acac1:caaat 

ttalitaacga 

ttgaactitga 

tgagtcatct 

atttacggtc 

ctgggtbcaa 

tcccatcttg 

tagtccaagc 

ctgattata't: 

tacataaatt 

ctcggtagca 

cccaccgtac 

catatccaga 

ccagtccccc 

gtcctcgccg 

cagtctgcgc 

ttcgggggcg 

tttcgtacgc 

gcatatgatc 

ttcgatagct 

tttagcgact 



32 

aattggctae 

tggatgatag 

aaacaaagaa 

ctccttcata 

tcc2gctttct 

aagtgtaaaa 

aacgtcgtga 

aaagaaatat 

aaaaacataa 

cagctggtac 

gatgatatag 

attcccgagg 

"tcgtcaattc 

gcgccgtagg 

aacatgtcca 

tctaagtgga 

gtgtgtcccg 

tcgtcgtccg 
gcgcggctgt 
aattcaaggc 
tgtcgtaata 
tgatgctctt 



atggttgtag 

atggataatc 

aaaaagttta 

ggagtgacat 

ccaecctcga 

catctggtct 

gacagtttgg 

agtttaaata 

atttattgat 

attgccgtag 

ctagct-bagc 

ctgtagccga 

caagggcatc 

gggcggagtc 

gatcgaaatc 

gctcgtcccc 

cggggagaaa 

ggagafccgag 

acgcggaccc 

cgaataagaa 

atggcggcat 

gatcttccaa 



aatgtagggt 

t:t:tagat:aca 

gcataaaagt 

gaactgacga 

aatgaatgtg 

cgaggtacct 

tgcaggtcga 

tttattgata 

gcaagtttaa 

atgaaagact: 

tcatcggggg 

cgatggtgcg 

ggtaaacatc 

gtggggggta 

gtctagcgcg 

caggctgaca 

ggacaggcgc 

caggccctcg 

ac^ttcacat 

ggctggctct 

actatcagra 

hacgcaacct 



aggctaattg 
cttttgctcc 
aaaaa-tcttg 
tctcaatggc 
gctgctctac 
cabiiaagcag 
atcgactcta 
ttagtgtaat 
atgaggttta 
gagtcatatt 
aaatcattga 
acaaactagg 
tttttttcat 
taacatgtag 
tttttttcat 
attggtaatt 
atcctaaaaa 
gcaatatgtt 
ctatgatttt 
catgttatgc 
gagggacaaa 
caaaaaattg 
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attrtcgaga 
attgcgatga 
ccagctttcc 
taaggcgtcg 
accaagcttc 
ctccaatgcg 

gactagtgga 

gtgtgtgctc 

tatggtgaaa 

cacggaccat 

ttgattttta 

ccagaaaaaa 

ggataagggg 

tgacaagtag 

aaatctgcac 

tctttttgta 

caacattttg 

gatgtttatt 

acattggatt 

tagtgafctct 

ctttgaatga 

cggacggtca 



taacgcgt tctutaytga aaaaccttgt 



gtttcatact 

cttagtaaag 

ccttttaaag 

agcaaagccc 

tgggcgagtt 

ctgttaatca 

tccgatatcg 

actactatga 

aaaactatga 

gttgcagcaa 

grgggcctcg 

ggatgtgggg 

aatcaatgga 

tgctattatt 

atgtgatgct 

atcgaatcaa 

ctagttttag 

gggtaaaaga 

tgatattcct 

tgrgtttgat 

cttttgaata 

ccggatttx:g 



gtttttctgt 

cacatctaaa 

ggcaaaagt g 

gcttattttt 

tacgggttgt 

ctttactttt 

cccgggcccg 

agctttgcac 

aattttgata 

attggaatta 

ttacatattc 

ccattttttt 

aaaagtttga 

cgacccgcga 

ttaattatgc 

gtatctttta 

actttgatgt 

aagggacttg 

aacattaatt 

gcttgaaaaa 

ggtgtaaaat 

gttttaggaa 



aggccgtgta 
acttttagcg 
agtatggtgc 

tacatgccaa 

taaacctticg 

atctaatcta 

actctagagt 

ttaaaaaaat 

ttttgatata 

aactattcat 

aagagttaga 

gtgtcttaag 

tgtactagag 

tgaaaggggc 

tttagacttt 

aactatgtat 

ttatatgctt 

attgagtatg 

taagtgtgtg 

tctacattca 

ccaatcctcc 

ttagaaattt 



4740 

^SOO 

4860 

4920 

49d0 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

54 60 

5520 

5560 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

61B0 

6240 

^300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

67B0 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 
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tattgataga 
catgagcgaa 
ctggagaaaa 
ctctagagtc 
S tgcttggtaa 
ttttagacaa 
ttaagttgtc 
cagctccccg 
ttgagtaaaa 
10 aaatacgcaa 
caagacigacc 
gttagtcgat 
accgctaacc 
ccggcgcgac 
IS gat;caaggca 
caccgccgac 
agcggccttt 
ggcgctggcc 
cccaggcact 
20 ccgcgaggtc 
aaagagaaaa 
agcaaggctg 
cagttgccgg 
attaccgagc 
25 aatgagtaga 
caccgacgcc 
ctgggttgtc 
agcggtcgca 
gagaagttga 
30 gtgaatcgtg 
ccggtgcgcc 
cgatgctcta 
gtctg-tcgaa 
acgtagaggt 
35 tgatggcggt: 
agcccggccg 
atggcggaaa 
ttgccatgca 
aagccttgat 
40 agatcgagct 
tgacggttea 
tggcacgccg 
gcagtggcag 
caaatgacct 
45 tcat:gcgc'ta 



ag-cattfctac 
accctatiaag 
atagagagag 
gaggtaccga 
taattgtcat 
gtatcaaacg 
taagcgtcaa 
accggcagcb 
cagcttgcgt 
ggggaacgca 
atcgcaaccc 
tccgatcccc 
gttgtcggca 
ttcgtagtga 
gccgacttcg 
ctggtggagc 
gtcgtgtcgc 
gggtacgagc 
gccgccgccg 
caggcgctgg 
tgagcaaaag 
caacgttggc 
cggaggatca 
tgctatctga 
tgaattttag 
gtggaatgcc 
tgccggccct 
aaccatccgg 
aggccgcgca 
gcaaggggcc 
g-tcgattagg 
tgacgtgggc 
gcgtgaccga 
ttccgcaggg 
fctcccatcta 
cgtgttccgt 
gcagaaagac 
gcgtacgaag 
tagccgctac. 
agctgattgg 
ccccgattac 
cgccgcaggc 
cgccggagag 
gccggagtac 
ccgcaacctg 



aaatacaaar 
aacccraatt 
atagatttgt 
gctcgaattc 
tagattgttr 
gatgttaact 
tttgtttaca 
cggcacaaaa 
catgcggtcg 
tgaaggttat 
atctagcccg 
agggcagtgc 
tcgaccgccc 
tcgacggagc 
tgctgattcc 
tggttaagca 
gggcgatcaa 
fcgcccattct 
gcacaacGgt 
ccgctgaaat 
cacaaacacg 
cagcctggca 
caccaagctg 
atacatcgcg 
cggctaaagg 
ccatgcgtgg 
gcaatggcac 
cccggtacaa 
ggccgcccag 
gctgatcgaa 
aagccgccca 
acccgcgata 
cgagctggcg 
ccggccggca 
accgaatcca 
ccacacgttg 
gacctggtag 
aaggccaaga 
aagaticgtaa 
atgtaccgcg 
tttttgatcg 
aaggcagaag 
ttcaagaagt 
gatttgaagg 
atcgagggcg 
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acatactaag 
cccttatctg 
agagagagae 
actggccgtc 
ttatgcaliag 
cagtacatta 
ccacaatata 
tcaccacgcg 
ctgcgtatat 
cgctgtactt 
cgccctgcsaa 
ccgcgattgg 
gacgattgac 
gccccaggcg 
ggtgcagcca 
gcgcattgag 
aggcacgcgc 
tgagtcccgt 
tcttgaatca 
taaatcaaaa 
ctaagtgccg 
gacacgccag 
aagatgtacg 
eagctaccag 
aggcggcatg 
aggaacgggc 
tggaaccccc 
atcggcgcgg 
cggcaacgca 
tccgcaaaga 
agggcgacga 
grcgcagcat 
aggtgatccg 
tggccagtgt 
tgaaccgata 
cggacgtact 
aaacctgcat: 
aeggccgcct 
agagcgaaac 
agatcacaga 
atcecggcat: 
ccagatggtt 
tctgtttcac 
aggaggcggg 
aagcatccgc 
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ggxttcttat atgctcaaca 
ggaaccactc acacat:tatt 
t:ggtgatttt ccgggggatc 
gttttacaac gactcagtac 
at:gcac1:c:ga aaticragccaa 
aagacgtccg caatgtgtta 

■ 

tcctgccacc agccagccaa 
tctaaaaagg tgatgtgtat; 
gatgcgatga gtaaataaac 
aaccagaaag gcgggtcagg 
ctcgccgggg ccgatgttct- 
gcggccgtgc gggaagatca 
cgcgacgrga aggccatcgg 
gcggacttgg ctgtgtcegc 
agcccttacg acatacgggc 
gtcacggatg gaaggctaca 
atcggcggtg aggttgccga 
atcacgcagc gcgtgagcfca 
gaacccgagg gcgacgctgc 
ctcatttgag ttaatgaggt 
gccgtccgag cgcacgcagc 
ccatgaagcg ggtcaacttt 
cggtacgcca aggcaagacc 
agtaaatgag caaatgaata 
gaaaatcaag aacaaccagg 
ggttggccag gcgtaagcgg 
aagcccgagg aatcggcgtg 
cgcrgggtga tgacctggtg 
tcgaggcaga agacgccccg 
atcccggcaa ccgccggcag 
gcaaccagat tttttcgttc 
catggacgtg gccgttttcc 
ctacgagctt ccagacgggc 
gtgggattac gacctggtac 
ccgggaaggg aagggagaca 
caagttctgc cggcgagccg 
tcggttaaac accacgcacg 
ggtgacggta rccgagggtg 
cgggcggccg gagtacatcg 
aggcaagaac ccggacgtgc 
cggccgtttt ctctaccgcc 
gtitcaagacg atctacgaac 
cgtgcgcaag ctgatcgggt 
gcaggctggc ccgatcctag 
cggttcctaa tgtacggagc 



744 
150* 
756( 
762C 
7 630 
774D 
7800 
78^0 
7920 
7380 
8040 
8100 
8160 
8220 
8280 
6340 
8400 
8460 
8520 
8580 
8640 
8700 
S760 
8820 
8880 
8940 
9000 
BQ6Q 
9120 
91B0 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
9660 
9720 
9780 
5840 
9900 
9960 
10020 
X008Q 
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agatgctagg 
atagcacglia 
acccaaagcc 
aaggcgattt 
cctgtgcata 
ggtcgctgcg 
gaaataccgc 
ctcactgact 
gcggtaatac 
ggccagcaaa 
cgcccccctg 
ggactataaa 
accctgccgc 
catagctcac 
gtgcacgaac 
tccaacccgg 
agagcgaggt 
actagaagga 
gttggtagct 
aagcagcaga 
gggtctgacg 
t:c1:cccaat:t 
tgatagtttg 
ccgcgaagcg 
gtgatctcgc 
cgatcttctt 
gccggcaggc 
actgcgctgt 
tcgggcggcg 
ggaaccggat 
gcttttgtca 
agaatgtca-t 
ggaargatgt 
ccaggggaag 
agccttacgg 
actgcggagc 
ccaactacct 
aactttgttt 
catcgaccca 
aaaaaacagt: 



gcaaattgcc 
cattigggaac 
gtacatrtggg 
ttccgcctaa 
actgtctggc 
ctccctacgc 
acagatgcgt 
cgctgcgctc 
ggttatccac 
aggccaggaa 
acrgagcat:ca 
garaccaggc 
ttaccggata 
gccgtaggta 
cccccgttca 
taagacacga 
atgtaggcgg 
cagtatttgg 
cttgatccgg 
titacgcgcag 
ctcagtggaa 
tgtgtagggc 
gctgtgagca 
gcgtcggctt 
ctttcacgta 
cttgtccaag 
gctccattgc 
accaaatgcg 
agttccatag 
caaagagttc 
gcaagatagc 
tgcgctgcca 
cgtcgtgcac 
ccgaagtttc 
tcaccgtaac 
cgtacaaatg 
ctgatagttg 
tagggcgact 
cggcgtaacg 
cataacaagc 



Ctagcagggg 
ccaaagccgc 
aaccggtcac 
aactcttraa 
cagcgcacag 
cccgccgctt 
aaggagaaaa 
ggtcgttcgg 
agaatcaggg 
ccgtaaaaag 
caaaaaizcga 
gtttccccct 
cctgtccgcc 
tctcagttcg 
gcccgaccgc 
cttatcgcca 
tgctacagag 
tatctgcgct 
caaacaaacc 
aaaaaaagga 
cgaaaactca 
trattatgca 
attatgtgct 
gaacgaattt 
gtggacaaat 
ataagcctgt 
ccagtcggca 
ggacaacgta 
cgttaaggtt 
ctccgccgct 
cagatcaatg 
ttctccaaat 
aacaatggtg 
caaaaggtcg 
cagcaaatca 
tacggccagc 
agtcgata^:t 
gccctigctgc 
cgcttgctgc 
catgaaaacc 
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aaaaaggtcg 
acattgggaa 
acatgtaagt 
aaciitiattaa 
ccgaagaget 
cgcgiicggcc 
tiaccgcatca 
ccgcggcgag 
gataacgcag 
gccgcgttgc 
cgctcaagtc 
ggaagctccc 
tttctccctt 
gtgtaggtcg 
tgcgccttat 
ctggcagcag 
ttct-tgaag-t 
ctgctgaagc 
accgctggta 
tctcaagaag 
cgttaaggga 

* 

cgcttaaaaa 
tagtgcatct 
cragctagac 
tcttccaact 
cnagcttcaa 
gcgacatcct 
agcactacat 
tcattragcg 
ggacctacca 
tcgatcgtgg 
tgcagttegc 
acttctacag 
litgatcaaag 
atatcactgt 
aacgtcggtt 
tcggcgatca 
g-taacatcgt 
ttggatgccc 
gccactgcgt 
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aaaaggtctc 
ccggaacccg 
gactgatata 
aactcttaaa 
gcaaaaagcg 
tatcgcggcc 
ggcgctcttc 
cggtatcagc 
gaaagaacat 
tggcgttttt 
agaggtggcg 
tcgtgcgctc 
cgggaagcgt 
ttcgctccaa 
ccggtaact:a 
ccactggtaa 
ggtggcctaa 
cagttaectt 
gcggtggtt fc 
atcctttgat 
ttttggtcat 
t:aataaaagc 
aacgcttgag 
attatttgcc 
gatctgcgcg 
gtatgacggg 
tcggcgcgat 
ttcgctcatc 
cctcaaatag 
aggcaacgct 
ctggctcgaa 
gcttagctgg 
cgcggagaat 
ctcgccgcgt: 
gfcggcttcag 
cgagatggcg 
ccgctticccc 
tgctgctcca 
gaggcataga 
tccatg 



tttcctgtgg 
tacattgg-ga 
aaagagaaaa 
acccgcctgg 
cctacGcttc 
ratgcggtgt 
cgcttcctcg 
tcactcaaag 
gtgagcaaaa 
ccataggctc 
aaacccgaca 
tcctgttccg 
ggcgctttct 
gctgggctgt 
tcgtectgag 
caggattagc 
ctacggctac 
cggaaaaaga 
ttttgtttgc 
cttttctacg 
gcatigatata 
agacttgacc 
ttaagccgcg 
gactaccttg 
cgaggccaag 
ctgatactgg 
tttgccggtt 
gccagcccag 
atcctgrtca 
atgttctctt 
gacacctgca 
ataacgccac 
ctcgctctct 
tgtttcatca 
gccgccatcc 
ctcgatgacg 
catgatgttt 
taacatcaaa 
ctgtacccca 



10140 
10200 
10260 
10320 

loaao 

10440 
10500 
10560 
10620 

i06ao 

10740 

loaoo 

10660 

10920 

10980 

11040 

11100 

11160 

11220 

11280 

11340 

11400 

11460 

11520 

11580 

11640 

11700 

11760 

11820 

iiaao 

11940 
12000 
12060 
12120 
121B0 
12240 
12300 
12360 
12420 
12466 
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<212> DNA 
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<213> daaoNit-PRecoinbinat 
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15 



20 



25 



<220> 

<r221> 

<222> 
<223> 

<220> 
<221> 

<222> 
<223> 

<220> 
<221> 
<222> 

<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<:221> 
<222> 
<223> 



ni± s c^f e a t ur e 
(38) . . (183) 

Agrobacteriuiu right border 



terminator 
(196) . , (400) 

compleznentary 



3SS tenainator 



mi s c^f eat u r e 
(445) . . (462) 

Cleavage / reccgnition site for I-Scel endonuclease 

terminator 
(5BS) . . (644) 

complementary: nos terminator 



iftisc^f eature 
(921) (2027) 

complementary 



coding for Rhodotorula gracilis D-amino acid o:.ida 



30 



35 



40 



<220> 
<221> 
<222> 
<223> 

<220> 
<:221> 
<222> 
<223> 

<220> 

<:221> 
<222> 
<223> 



pjcomoter 
(2071) . , (3882) 

complementary: A-thaliana nitrilasa I promoter 

terminator 
(3972) , . (4176) 

complementary: 35s terminator 



misc feature 

(4251) (6248) 
comp lement ar y : 



coding for beta-glucuronidasi 



* 
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proiftoter 
(6302) , . (7619) 

con^lementar. coding £or sTPT promoter 



10 



15 



20 



30 



35 



40 



<220> 
<221> 
<:222> 
<223> 



mi s c_^f e a t ur e 
(7691) . . (7907) 
Agrobacterium left border 



45 



<400> 1€ 

aatattcaaa caaacacata cagcgcgact 
aaaccttttc acgccctttt aaatatocga 
tttacccgoc aaratatcct gtcaaaoact 
atcagatctg gcaciccggtc actggatttt 
aagtatttta caaatacaaa tacatactaa 
aaccctataa gaaccctaat tccctratct 
aatagagaga gatagatttg tagagagaga 
tgtctcacga cgttccgaac ccagattacc 
tgatggatat ctgcagaatt egccctttta 
atcaattcac tggccgtcgt tttacaacga 
aacatagatg acaccgcgcg cganaattta 
accgcgtatt aaatgtataa ttgcgggact: 
gtcatgcatt acatgtuaat tattacargc 
atcatcgcaa gaccggcaac aggattcaat 
cgatcgggga tcatccgggt otgtggcggg 
atctagagct tgggtcccgc ctacaacttc 
aacgcctcgt cgacgagctg cgcgacatcc 
gcactcgaga agccatacgc atgcacaagc 
ctgcccctgc cgagcgagag gggcgacttt 
gcctcaacgc ggggtccgcc tcgtcgtgca 
tcgetgcctt cgatcgttcc gtcgctcgag 
aggatccgct ggaccgtctc tgggttgaca 
ccgccgcaga tgacttcgcc acctggtcgg 
gacgagtcca tcgtgcatcg cttgcatggg 
ggctcggcgg cttggtcgtc gatgcccgca 
accaccaaat ccgcaccgtc gaacgcctgc 
gtcgcgccga gcttctgcag ctctcttgca 
gagagggtgt cgtaggttac gccgatagcg 
cggtaatttg gcgtgatgtc cttgtaccag 
aaccgcctcg tccccttgag ccacatggca 
ttgaaagtcg attcttccca ttttgcttgt 
gtocaattcg cgccagccca tggtgaagcg 
tcgcgcgcga gaatatgcac gctgtagcce 
ccgataacgc ctgatccgag gacaacgacg 



tatcatggac 
ttattctaat 
gatagtttaa 
ggttttagga 
gggtttctta 
gggaactact 
Gtggtgattt 
ctgttatccc 
gatcagcaca 
ctcagagctt 
tcctagtttg 
ctaatcataa 
ttaacgtaat 
cttaagaaac 
aactccacga 
gactcccgcg 
tccgccgcgc 
gtgacctcct 
gtcoggtcga 
ggtcgcaagc 
atggtcgggt 
gacaagtccc 
ggaatgatgt 
gacttgacga 
atcgactugg 
tcaagcgacg 
aggtactggc 
ccaggrggac 
tgcccgagca 
tggcccgtcg 
cgaggaccgt 
aaagtctggc 
ttccgagcga 
cgcttctgcg 



atacaaatgg 
aaacgctctt 
actgaaggcg 
attagaaa'ct 
tatgctcaac 
cacacattat 
ttgcgccggg 
tagtcgagcg 
ctggcggccg 
gacaggaggc 

cgcgctatat 
aaacccatcli 
tcaacagaaa 
tttattgcca 
aaatatccga 
ccgcgccgtg 
cccaactctg 
tctecttcgc 
gaggcaggac 
cgacgttgtg 
cgaggcgcaa 
agtctcccac 
aggcgggaga 
ggacggtttg 
cgccaagtcc 
taacggtccg 
agtactttgg 
attcggaaga 
agccgtcttc 
^ggaccaactc 
ctgtaagcgt 
tcgagacgtc 
ggatgagggc 
agtgcatggg 



acgaacggat 


60 


tcetcttagg 


120 


ggaaacgaca 


ISO 


ttattgarag 


240 


acatgagcga 


300 


tcrggagaaa 


360 


taccccaaac 


420 


gccgccagtg 


4S0 


ttactagtgg 


540 


ccgatctagt 


600 


tttgttttct 


660 


cataaataat: 


720 


ttatatgata 


780 


aatgtttgaa 


840 


acgcagcaag 


900 


gtaccgctgg 


960 


ctggt:atcc=c 


1020 


cgctcgtgcg 


lOBO 


gatccgrtct 


114 0 


gcggaggacc 


1200 


gcagtgcttg 


1260 


gccgtacgtc 


1320 


agcggggtcg 


1380 


cccgcggatt 


1440 


cgtagcgttg 


1500 


tctctcaaac 


1560 


tgcgtggacg 


1620 


tgggaggggg 


16B0 


gttctgcgcg 


1740 


gacccacttc 


1800 


catgaaaggc 


18 60 


ctccggcaag 


1920 


gctgctcaga 


1980 


ccctcgaata 


2040 
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gagtcgagat ccgataccgc ccgggctcga gtctrtgttt cctaettbgg ttcatgacac 2X00 

ccagagactt gagagaagca atatafcagac ttttttttgt tttttttttg tggrcacgtt 2160 

tatttLtccta ttggagacgg taacgaagat: cgaacctgtg gtiggaaat:ga aa^zaaggtrgg 2220 

gactagccca cgtggtttct tttctctgca ttgatttgtc tttgcttttt titgtaaagtt 2260 

5 cacatcaaac ctactaataa ttgaga^gaa aaataaaatc tattgattga tcaaaccagc 2340 

cgatgctfca tgtcrgaata caaaaaagaa gtgaaaaccc cgtttaagaa ctacaacggt 2400 

ggtttacaaa gtatttggac acaataaatc caaacgaaat aaaacaaaat ggagaactac 24 60 

caaataaaaa acaaataaaa aacttaaaag aattcattcc atttttcttc cegtagaatt 2520 

tattctttta tggattcctt aaatccatat ttgatgcatt ttgattcctc ataataggta 2580 

1Q ataatatata ctatgttata gatatgtttc taattcgtac taacctacct ttttttggtc 2640 

gtacgattct acctaataat attgaacgga attgatgttt tggaccactt agaaagtatt 2700 

ttttttt:tgg tttgtcttag ctgtatttca t:taaatataa atttaaataa gaaatgtcat 2760 

aaataaaatt tgacgtatag atttcttaaa tccatttrat gttatttaat atttgaaatg 2820 

tgagtttggc tcctatttaa tcttaggatg ggttaatact aagtttfccct taatgaatta 2880 

15 tctcagagaa actggattaa ataaactaaa aaatagatca atgtgttttg gtccggtcaa 2940 

atiatctttgg atttactatt attggcgaaa agaaagtctc atatagtaaa tcacattcct: 3000 

acaagagaaa tcaaaatttt tgaattaaca tggattgtat agtttcttat ataaecaatt 3060 

agttcgcatc aagaaaacca aaccccaatt aataatcaaa cgggcttggt aggaatattt 3120 

cattgcagct ttcagataaa agaaaaaaac acacacccaa gtcttttatt tcatctttct 3180 

20 tactfcgcagg aactcaaatt cscacttcgcc acttttcttt acaaataaac acaaatfcgtc 3240 

aatigaaacga aatagtcttt t:t:atgcaaac actgtttgtc tttiit-tcgat cacgtttctg 3300 

attgtgacag ccatccatat atatagggaa cgtaaaaeaa caacatgtga agtcacatat 3360 

. acgtaatggt ttagcatagc ttctattttc gttgtcaata ttagtcactc caaaacattt 3420 

-btaagaaaaa iiaaattaata tatgtatatt cttggaacta atgtatgbgg aaatacagca 3480 

25 acttaattat t.aaacatt:ct aaatgcaaat atgcaaagaa aaaaaagaaa agaacacaac 3540 

tgaaatcaaa gccagattca taataattgg ctacabggtt gtagaatgta gggtiaacaca 3600 

acatccagaa ttgaacactc aaa-t:t;ggatg atagaiiggat aatcttraga tacaagagaa 3660 

tcggttctct: tccattatta acgaaaataa agaaaaaaag tttagcataa aagtttgaaa 3720 

ctcaacataa cattttgaac ttgactcctt cataggagtg acatgaactg acgaatcaca 3780 

30 aecgattact tgttrgagtc atcttccgct tbctccacct tcgaaatgaa ngtgaccggt 3840 

ttcttcgggt gctcatttac ggtcaagtgt aaaacatctg gtctcgaggt acctggtagg 39QQ 

gataacaggg taatctgggt tcaaaacgtc gtgagacagt ttggtgcagg tcgaaattcg 3S60 

agctcggtae ccggtcactg gattttggtt ttaggaatta gaaattttat tgatagaagt 4020 

attttacaaa tacaaataca tactaagggt ttct:t:atat:g ctcaacacat gagcgaaacc 4060 

35 craliaagaac cctaattccc tt:atcitggga actactcaca cattattctg gagaaaaata 4140 

gagagagata gatttgtaga gagagactgg tgatttttgc gccgggtacc gagctcggta 4200 

gcaattcecg aggctgtagc cgacgatggt gcgccaggag agttgttgat tcattgtttg 4260 

cctccctgct gcggtttttc accgaagttc atgccagtcc agcgtttttg cagcagaaaa 4320 

gccgccgact tcggtttgcg gtcgcgagtg aagatccctt tcttgttacc gccaacgcgc 4380 

40 aatatgcctt gcgaggtcgc aaaatcggcg aaattccata cctgttciacc gacgacggcg 4 440 

ctgacgcgat caaagacgcg gtgatacata tccagccatg cacactgata ct.ct:t:cactc 4500 

caeatgtcgg tgtacattga gtgcagcccg gctaacgtat ccacgccgta tfccggtgaxig 4560 

ataatcggct gafcgnagttt: ctcctgccag gccagaagtt ctttttccag taccttctct 4620 

gccgtttcca aatcgccgct ttggacatae tsatccgtaat aacggttcag gcacagcaca 4680 

45 tcaaagagat cgctgatggt atcggtgtga gcgtcgcaga acattacatt gacgcaggtg 4740 
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atcggacgcg 
acttgcggac 
ttgtcacgcg 
actgcctctt 
S gagaggtliaa 
cagtcgagca 
cagtccatta 
tccgcatctt 
tgttcgccct 

10 gtctggcttt 
tgcggattca 
cccgcatcac 
gcgtggttac 
ggcgtggtgt 

15 taagactgct 
cagttcagtt 
tattagaaaa 
tiatiactacat: 
gataattalit: 

20 ataacatacg 
tcctgattat 
cgctggcctg 

I. ex e. u u g<9. ca. 

cggctttctt 

25 agactgaatg 
cgtaccatgg 
atgaaatcga 
tcgagtattt 
aat:ccagt:at 

30 g-tagagaatg 
gggtccatct 
gaagttccat 
tgaaattttg 
tattgaacgt 

35 taaaggtatc 
cttcatgctc 
cgtaaracaa 
tacaaaacaa 
aaaaatacaa 

40 gtiactcaatt 
ttctgtccat 
atgt-tcacta 
atcactacaa 
attaagtaaa 

45 gtgattgatg 



tcgggtcgag 

gggtatccgg 

cratcagckc 

cgct:gtacag 

agccgacagc 

tctcttcagc 

atgcgtggtc 

catgacgacc 

tcactgccac 

tggctgtgac 

ccactrgcaa 

gcagttcaac 

agtcttgcgc 

agagcatrac 

tttzcttgcc 

cgttgttcac 

gctataaatt 

tcttttattt 

atata-tatat 

gcgtgacatc 

tgacccacac 

cccaaccttt 

ucn-cuguauc;: 

gtaacgcgct 
cccacaggcc 
tcgatcgaet 
aattcagagt 
caagagatgg 
ctcgcggtgt 
atgtaattag 
tatcttgata 
atcaccatca 
gragatttba 
gacagattca 
tctatttaat 
atagtcatat 
ttttataatt 
aaaaatcttc 
atigaacccac 
caccctatac 
ttgaccaaat 
tttctcgatt 
atatgttata 
catgattctt 
ttattggttg 



ttcacgcgtt 

ttcgttggca 

tttaatcgcc 

ttctttcggc 

agcagtttca 

gtaagggtaa 

gtgcaccatc 
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tttttcatta 

caagaaagtia 

ttcttttgtt 

acgttattct 

ttatcaaagt 

taattttttt 

atgtattatg 

tctttttfctt 

tcaattaatt 

tgagggaacc 

gtttatactt 

cttatgaagt 

attgagngat 
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gcttccgcca 

atactccaca 

tgtaagtgcg 

ttgttgcccg 

tcaat caeca 

tgcgaggtac 

agcacgttat 

aagtagaacg 

ccgacgcgaa 

tagagataac 

gtgccttgtc 

ccattggcca 

accacggtga 

attccggcat 

gbaateacca 

atacctgcac 

cacttataaa 

atttgaaaca 

agcagaaact 

ggcgtatagc 

kgagtgaccg 

acttcgcgct 

tegt^uaaaae 

cgctgaccaa 

ttgatttcac 

ggaticcgata 

agagcaaaga 

tcggaeggct 

ggtagggtta 

tgaga-tactg 

ttttagagac 

aattttgttt 

cttaccttgt 

gattttttat 

tatggttgtc 

aggctttttt 

tatacaacta 

tgttgtagcc 

cgtttaacta 

tttttttcta 

tctgtaattg 

cgagtttaaa 

aatagtggtg 

tgattgagtg 

tattgtatta 



gtggcgaaar 

tcaccacgct 

cttgctgagt 

cbtcgaaacc 

cgatgccarg 

ggtaggagtt 

cgaatccttt 

gttngtggtt 

gcgggtagat 

cctcacccgg 

cagtitgcaac 

ccacctgcca 

tatcgtccac 

agtitaaagaa 

ttcccggcgg 

atcaacaaat 

ctacagaaaa 

ttatattact 

tacgtacact 

cgccctgatg 

catcgaaacg 

gataccagac 

tgcctggcac 

ttccacagtr 

gggttggggt 

tcgcccgggc 

gggacggact 

acgatgatet 

atggtcaatt 

gtttagattg 

acagtgaata 

ttttggaaga 

atgttgagtg 

aaairgaaatt 

tgtcagcatt 

gaatatctta 

taactaaaat 

agtttactcg 

ggattaccat 

gttccaccca 

cgatttaaaa 

tatgataaaa 

tttttgggga 

attattgtat 

gtatgtaagc 



attcccgtgc 

tgggtggttt 

tfcccccgttg 

aatgcctaaa 

ttcarctgcc 

ggccccaatc 

gccacgtaag 

aatcaggaac 

accacactct 

ttgccagagg 

cacctgttga 

gticaacagac 

ccaggtgttc 

atcatggaag 

gatagtctgc 

tttggtcata 

gcaatcgcta 

actaatraat 

tttcccggca 

ctccatcact 

cagcacgata 

gttgcccgca 

agcaattgcc 

ttcgcgatcc 

ttctacagga 

tcgactctag 

tatgaggatt 

cgatttggtt 

tcatctaacg 

gatgagtgta 

ttagccaatc 

taataatgai: 

gtcttcaaat 

ttacrtattt 

tgacttgcag 

ttacttttrtfc 

tagggttttc 

taagttacaa 

gtactttcat 

atctataaaa 

ttaatattac 

atgttgaccc 

taattgauga 

gtaaacctat 

aaagatgatt 
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gttcttatga ggtaatttgt tactcatcca tccttctgca tatgagaaat tgtgttagcg 
tacgcaaaac aatagagaac ataaaagata tgtgtattta .tttaaggtga cttfctgttaa 
tgatatcgta gtatctatac atttaCatat aacttgttga atttgagtat aagctateag 
gatecggggg atcctctaga gtcgaggtac cgagctcgaa ttcactggcc gtcgttttac 
aacgactcag tactgcttgg taataattgt cattagattg tttttatgca tagatgcact 
cgaaatcagc caatttcaga caagtatoaa acggatgtca attcagcaca ttaaagacgt 
ccgcaatgrg ttattaagtt gtctaagcgt caatttgttt acaecacaat atatcctgcc 
acoagccagc caacagcccc ccgaccggca gctcggcaca aaatcaccac gcgtccaaaa 
aggtgatgtg tatttgagta aaacagcttg cgtcatgcgg tcgctgcgta tatgatgcga 
tgagtaaata aacaaacacg caaggggaac gcatgaaggt tatcgccgta cttaaccaga 
aaggcgggtc aggcaagaeg accaccgcaa cccatctagc ccgcgccctg caactcgccg 
gggccgatgt tctgttagtc gatrccgatc cccagggcag tgcccgcgat tgggcggccg 
tgcgggaaga tcaaecgcta accgttgtcg gcatcgaccg ccegacgatt gaccgcgacg 
tgaaggccat cggccggcgc gacttcgtag tgatcgacgg agcgccccag gcggcggact 
tggctgcgtc cgcgatcaag geagccgact tcgtgccgat tecggtgcag ccaagccctt 
acgacatatg ggccaccgcc gacctggtgg agctggttaa gcagcgcatt gaggtcacgg 
atggaaggct acaagcggcc cttgtcgtgt cgcgggcgat caaaggcacg cgcatcggcg 
gcgaggttgc cgaggcgctg gccgggtacg agctgcccat tcttgagtcc cgtatcacgc 
agcgcgtgag ctacccaggc actgcogccg ccggcacaac cgttcttgaa tcagaacccg 
agggcgacgc tgcccgcgag gtccaggcgc tggccgctga aattaaatea aaactcattt 
gagttaatga ggtaaagaga aaargageaa aagcacaaac acgctaagtg ccggccgtcc 
gagcgcacgc agcagcaagg ctgcaacgtt ggccagcctg gcagacacgc cagccatgaa 
gcgggtcaac tttcagttgc cggcggagga tcacaccaag ctgaagargt acgcggta«=g 
ccaaggcaag accattaccg agctgctatc tgaacacate gcgcagctac cagagtaaat 
gagcaaatga ataaatgagt agatgaattt tagcggctaa aggaggcggc atggaaaatc 
aagaacaacc aggcaccgac gccgtggaat gccccatgtg tggaggaacg ggcggttggc 
caggcgtaag eggetgggtc gtctgccggc ectgcaatgg cactggaacc cccaagcccg 
aggaatcggc gtgagcggtc gcaaaccatc cggcccggca caaatcggcg eggcgctggg 
tgatgacctg gtggagaagt tgaaggccgc gcaggccgcc cagcggcaac gcatcgaggc 
agaagacgcc ccggtgaatc gtggcaaggg gccgctgatc gaateogcaa agaatcccgg 
caaccgccgg cagccggtgc gccgtcgatt aggaagccgc ccaagggcga cgagcaacca 
gattttttcg ttccgatgct ctatgacgtg ggcacccgcg acagtcgcag caccatggac 
gtggccgttt tccgtctgtc gaagegtgae cgaegagctg gcgaggtgat ecgctacgag 
ettecagacg ggcacgtaga ggtttccgca gggccggccg gcatggccag tgtgtgggat 
tacgacctgg tactgatggc ggtttcecat ctaaccgaat ccatgaaccg ataccgggaa 
gggaagggag acaagcccgg ccgcgtgtto cgtccacacg ttgcggacgt acteaagttc 
tgccggegag ccgatggcgg aaagcagaaa gacgaoctgg tagaaacctg cattcggtta 
aacaccacgc acgttgccat gcagcgtacg aagaaggcca agaacggccg cctggtgacg 
gtatccgagg gtgaagcctt gattagccgc tacaagatcg taaagagcga aaccgggcgg 
ccggagtaea tcgagatcga gctagctgat tggatgtacc gcgagatcac agaaggcaag 
aacccggacg tgctgacggt tcaccccgat tactttttga tcgatcccgg categgccgt 
tttctctacc gcctggcacg ecgcgccgca ggcaaggcag aagccagatg gttgttcsaag 
acgatctacg aacgcagtgg csagcgccgga gagttcaaga agttctgttt caccgtgcgc 
aagctgatcg ggtcaaatga cctgccggag tacgatctga aggaggaggc ggggcaggct 
ggcccgatcc tagtcatgcg ctaccgcaac ctgaccgagg gcgaagcatc cgccggttcc 
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taatgtacgg agcagatgct agggcaaact 
ctctttcctg tggatagcac gtacattggg 
ccgtacattg ggaacccaaa gccgtacatt 
ataaaagaga aaaaaggcga tttttccgcc 
S aaaacccgcc tggcctgtgc ataaccgtct 
gcgcctaccc ttcggtcgct gcgctcccta 
gcctatgcgg tgtgaaatac cgcacagatg 
ttccgcttcc tcgctcactg actcgctgcg 
agctcactca aaggcggtaa tacggrtatc 
10 eatgtgagca aaaggccagc aaaaggccag 
tttccatagg ct ccgccccc ctgacgagca 
gcgaaacccg acaggactat aaagatacca 
ctctcctgtt ccgaceetgc cgcttaecgg 
cgtggcgctt tctcatagct cacgctgtag 

IS caagctgggc tgtgtgcacg aaccccccgt 
ctatcgtctt: gagtccaacc cggliaagaca 
taacaggatt agcagagcga ggtatgtagg 
taactacggc tacactagaa ggacagtatt 
cttcggaaaa agagttggta gctcttgatc 

20 tttttttgtt tgcaagcagc agattacgcg 
gatcttttct acggggrctg acgctcagtg 
catgcatgat atatctccca atttgtgtag 
agcagacttg accrgatagt ttggctgtga 
gagttaagcc gcgccgcgaa gcggcgtcgg 

25 gccgactacc ttggtgatct cgcctttcac 
gcgcgaggcc aagcgatctt crtcttgtcc 
gggctgatac tgggccggca ggcgctccat 
gattttgccg gttactgcgc tgtaccaaat 
atcgccagcc cagtcgggcg gcgagttcca 

30 tagatcctgt tcaggaaccg gatcaaagag 
gctatgttct cttgcttttg tcagcaagat 
gaagatacct gcaagaatgt cattgcgctg 
tggataacgc cacggaatga tgtcgtcgtg 
aatctcgctc tctccagggg aagccgaagt 

35 cgttgtttca tcaagcctta cggtcaccgt 
caggccgcca tccactgcgg agccgtacaa 
gcgctcgatg acgccaacta cccctgatag 
ccccatgatg tttaactttg ttttagggcg 
ccataacatc aaacatcgac ccacggcgta 

40 agactgtacc ccaaaaaaac agtcataaca 
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gccctagcag gggaaaaagg tcgaaaaggt. 10200 

aacccaaagc cgtacattgg gaaccggaac 102 60 

gggaaccggt cacacatgta agtgactgat 10320 

taaaactctt taaaacttat taaaact:ctt 103B0 

ggccagcgca cagccgaaga gctgcaaaaa 10440 

cgccccgccg cttcgcgtcg gcctatcgcg 10500 

cgtaaggaga aaataccgca tcaggcgctc 10560 

ctcggtcgtt cggctgcggc gagcggtatc 10620 

cacagaatca ggggat^aacg caggaaagaa 10 680 

gaaccgtaaa aaggccgcgt tgctggcgtt 10740 

tcacaaaaat cgacgctcaa gtcagaggtg 108 00 

ggcgtttccc cctggaagct ccctcgtgcg 10860 

atacctgtcc gcctttctcc cttcgggaag 10920 

gtatctcagt tcggtgtagg tcgttcgccc 109 BO 

tcagcccgac cgctgcgcct tatccggtaa 1104 0 

cgacttatcg ccactggcag cagccactgg 11100 

cggtgctaea gagttcttga agtggtggcc 11160 

tggtatctgc gctctgctga agccagttac 11220 

cggcaaacaa accaccgctg gtagcggtgg 112 BO 

cagaaaaaaa ggatctcaag aagatccttt X1340 

gaacgaaaac tcacgttaag ggattttggt 114 00 

ggcttattat gcacgcttaa aaataataaa 114 60 

gcaattatgt gcttagtgca tctaacgctt 11520 

cttgaacgaa tttctagcta gacattattt 11580 

gtagtggaca aattcttcca actgatotgc 11640 

aagataagcc tgtctagctt caagtatgac ' 11700 

tgcccagtcg gcagcgacat ccttcggcgc 11760 

gcgggacaac gtaagcacta catttcgctc 11620 

tagcgttaag gtttcattta gcgcctcaaa 11860 

ttcctccgcc gctggaccta ccaaggcaac 11940 

agccagatcsa atgtcgatcg tggctggctc 12000 1 

ccattctcca aattgcagtt cgcgcttagc 120 60 I 

cacaacaatg gtgacttcta cagcgcggag 12120 I 

ttccaaaagg tcgttgatca aagctcgccg 12160 I 

aaccagcaaa ticaatatcac tgtgtggctt 12240 I 

atgtacggcc agcaacgtcg gttcgagatg 12300 I 

ttgagtcgat acttcggcga ccaccgcttc 12360 I 

actgccctgc tgcgtaacat cgttgctgct 12420 I 

acgcgcttgc tgcttggatg cccgaggcat 12480 I 

agccatgaaa accgccactg cgctccatg 12533 I 
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OXDA_C AE EL/303-321 
OXDA_FUS SO/325-343 
OXDA_HUMAN/3 05-323 
OXDA_MOUSE/304-322 
OXDA_PIG/305-323 
OXDA_RAB I T / 3 0 5 -3 2 3 
OXDA_i^T / 3 0 4 - 3 2 2 
OXDA__RHOTO/3 27-345 
OXDA_TRI VR/ 3 22-340 
OXDD_BOVIN/3 00-318 
OXDD HUMAN/300-318 
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